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Nucleic Acids Encoding [Hi^Trp^Tyi^-GnRH Preprohormone and 
5 [Ser*]-GnRH Preprohormone and Their Uses 

BACKGROUND OF THE INVENTION 
This invention relates to DNA and protein 

10 compositions useful, for instance, in the regulation of 
reproductive function, and which are also useful in the 
development of transgenic animals with desirable reproductive 
characteristics. Specifically, this invention relates to DNA 
and protein compositions for the precursor protein for the 

15 chicken II form of gonadotropin- releasing hormone and to the 
[Ser 8 ] -GnRH preprohormone. 

Gonadotropin- releasing hormone (GnRH) is an important 
reproductive hormone in vertebrates. GnRH is a neuropeptide 
which regulates the production of follicle stimulating hormone 

20 (FSH) and luteinizing hormone (LH) . In mammals, GnRH is 

produced by the hypothalamus and regulates the release of FSH 
anH lh, which in turn regulate the development and function of 
the reproductive organs. 

There are eight different forms of GnRH that have 

25 been identified in different vertebrate species (see figure 
1.) For a review of the species distribution and properties 
of the different forms of GnRH, See Sherwood, N.M. , et al. 
(1993) Endocrine Rev. 14:241-254. For instance, the more 
recently evolved mammals have a form of GnRH designated 

30 mammalian GnRH. A number of primitive placental mammals, 

nonplacental mammal. s and other vertebrates have more than one 
form of GnRH. One form of GnRH that is particularly 
widespread in this latter group of animals and has been named 
chicken II GnRH, since it was the second form of GnRH that was 

35 discovered in the chicken. This form of GnRH is also known as 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH since it has amino acid substitutions in 
the 5, 7 and 8 positions as compared to mammalian GnRH (see 
figure 1) . 
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[His 5 , Trp 7 ,Tyr 8 ] -GnRH is a potent stimulator of 
reproductive function in a wide variety of species. It is 
particularly active in fish. [His 5 ,Trp 7 ,Tyr 8 ] -GnRH is the 
most potent form of GnRH in fish. {See Sherwood, N.M. , et al., 
5 supra and Ngamvongchon, S., et al. (1992) Segrulatoxy Peptides 
42:63-73.) [His 5 ,Trp 7 ,Tyr 8 ] -GnRH also apparently has 
significant biological activity in more recently evolved 
mammals . 

One of the problems in fish aquaculture is control 

10 of fish reproduction. For some species of fish, it is 
difficult to induce the fish to reproduce in captivity. 
Alternatively, it may be difficult to induce fish in captivity 
to reproduce at the time of year that is normal to the 
species. One approach to the control of reproduction in fish 

15 for aquaculture is the development of transgenic fish by 
introduction of DNA encoding the [His 5 , Trp 7 ,Tyr 8 ] -GnRH 
preprohormone and the [Ser 8 ] -GnRH preprohormone. 

Many of the potential uses of GnRH, for example fish 
aquaculture, require identification and isolation of the gene 

20 encoding the precursor protein from which the peptide is 

produced. The sequence of the [His 5 , Trp 7 ,Tyr 8 ] -GnRH and the 
[Ser 8 ] -GnRH preprohormones and the sequence of the genes which 
encode these proteins have not been described in the prior 
art. Identification of the genes encoding these sequences 

25 will facilitate production and use of the decapeptide hormones 
and other biologically active fragments in a variety of 
applications. These and other needs are addressed by the 
present invention. 

30 SUMMARY OF THE INVENTION 

The present invention provides compositions for 
isolated nucleic acids encoding vertebrate [His 5 ,Trp 7 ,Tyr 8 ] - 
GnRH preprohormones and [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone 
GAP peptides. The invention also provides for isolated 

35 nucleic acids encoding vertebrate [Ser 8 ] -GnRH preprohormones 
and [Ser 8 ] -GnRH preprohormone GAP peptides. In addition, the 
invention also provides for compositions comprising the above 
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preprohormone polypeptides. The preprohormone polypeptides 
may be, for example, recambinantly produced. 

Nucleic acid probes that selectively hybridize to 
nucleic acids encoding the above preprohormone polypeptides 
are also provided. The invention provides methods of 
detecting nucleic acids encoding the preprohormone 
polypeptides by nucleic hybridization assays utilizing these 
probes. Antibodies that are reactive with the preprohormone 
polypeptides are provided, along with immunoassay methods 
based on these antibodies and which detect the preprohormone 
polypeptides. These nucleic acid hybridization assays and 
immunoassays are useful as measurements of reproductive 
function in a variety of different situations. 

The invention further provides for transgenic 
animals created by the introduction of a DNA construct 
encoding a vertebrate [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone or a 
[Ser 8 ] -GnRH preprohormone. These transgenic animals include 
fish species useful in aquaculture and farm animal species, 
such as chickens and other animals. 

DEFINITIONS 

Abbreviations for the twenty naturally occurring -~ 
amino acids follow conventional usage. In the polypeptide 
notation used herein, the left-hand direction is the amino 
terminal direction and the right-hand direction is the 
carboxy- terminal direction, in accordance with standard usage 
and convention. Similarly, unless specified otherwise, the 
left hand end of single -stranded polynucleotide sequences is 
the 5' end; the left hand direction of double -stranded 
polynucleotide sequences is referred to as the 5 1 direction. 
The direction of 5' to 3' addition of nascent RNA transcripts 
is referred to as the transcription direction; sequence 
regions on the DNA strand having the same sequence as the RNA 
and which cure 5' to the 5' end of the RNA transcript are 
referred to as "upstream sequences"; sequence regions on the 
DNA strand having the same sequence as the SNA and which axe 
3' to the 3 V end of the RNA transcript are referred to as 
"downstream sequences" . 
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The nucleotide codes shown below are used in the 
nucleic acid sequences of the invention. This code has been 
adopted by the IUPAC-IUB Biochemical Nomenclature Commission. 



5 


Code 


Group 


wucieotiuc \o) 




A 


A 


aaenxne 


10 


C 


C 


cytosine 




6 


G 


guanine 




T 


T 


thymine 


15 


U 


U 


uracil 




Y 


C or T(U) 


pyrimidine 


20 


R 


A or 6 


purine 




M 


A or C 


amino 




K 


6 or T(U) 


keto 


25 


S 


G or C 


3 hydrogen bonds 




W 


A or T(U) 


2 hydrogen bonds 


30 


H 


A or C or T(U) 


not-G 




B . 


G or T(U) or C 


not -A 




V 


G or C or A 


not-T(U) 


35 


D 


G or A or T(U) 


not-C 




N 


G,A,C or T(U) 


any 



40 

The term "nucleic acids", as used herein, refers to 
either DNA or RNA. "Nucleic acid sequence" or "polynucleotide 
sequence" refers to a single- or double- stranded polymer of 
deoxyribonucleotide or ribonucleotide bases read from the 5' 
45 to the 3 1 end. It includes both self -replicating plasmids, 
infectious polymers of DNA or RNA and nonfunctional DNA or 
RNA. 

The following terms are used to describe the 
sequence relationships between two or more nucleic acids or 
50 polynucleotides: "reference sequence", "comparison window", 
"sequence identity", "percentage of sequence identity", and 
"substantial identity". A "reference sequence" is a defined 



WO 95/12309 



5 



PCT/DS94/12763 



sequence used as a basis for a sequence comparison; a 
reference sequence may be a subset of a larger sequence, for 
example, as a segment of a full-length cDNA or gene sequence 
given in a sequence listing, such as the nucleic acid sequence 
5 of figure 2, or may comprise a complete cDNA or gene 

sequence. Generally, a reference sequence is at least 20 
nucleotides in length, frequently at least 25 nucleotides in 
length, and often at least 50 nucleotides in length. Since 
two polynucleotides may each (1) comprise a sequence (i.e., a 

10 portion of the complete polynucleotide sequence) that is 

similar between the two polynucleotides, and (2) may further 
comprise a sequence that is divergent between the two 
polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences 

15 of the two polynucleotides over a "comparison window" to 

identify and compare local regions of sequence similarity* 

A "comparison window", as used herein, refers to a 
conceptual segment of at least 20 contiguous nucleotide 
positions wherein a polynucleotide sequence may be compared to. 

20 a reference sequence of at least 20 contiguous nucleotides and 
wherein the portion of the polynucleotide sequence in the 
comparison window may comprise additions or deletions (i.e., 
gaps) of 20 percent or less as compared to the reference 
sequence (which does not comprise additions or deletions) for 

25 optimal alignment of the two sequences. 

Optimal alignment of sequences for aligning a 
comparison window may be conducted by the local homology 
algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482, 
by the homology alignment algorithm of Needleman and Wunsch 

30 (1970) J. Mbl. Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 
(USA) 85:2444, by computerized implementations of these 
algorithms SAP, BESTFIT, PASTA, and TFASTA in the Wisconsin 
Genetics Sottware Package Release 7.0, Genetics Computer 

35 Group, 575 Science Dr., Madison, WI) , or by inspection, and 

the best alignment (i.e., resulting in the highest percentage 
of sequence similarity over the comparison window) generated 
by the various methods is selected. Preferred programs for 
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this use include GAP, BESTFIT AND FASTA, with standard 
parameters . 

The term "sequence identity" means that two 
polynucleotide sequences are identical (i.e., on a nucleotide- 
5 by-nucleotide basis) over the window of comparison. The term 
■percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over the window of comparison, 
determining the number of positions at which the identical 
nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both 

10 sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size), 
anr» multiplying the result by 100 to yield the percentage of 
sequence identity. 

15 The terms "substantial identity" or "substantial 

sequence identity" as applied to nucleic acid sequences and as 
used herein and denote a characteristic of a polynucleotide 
sequence, wherein the polynucleotide comprises a sequence that 
has at least 85 percent sequence identity, preferably at least 

20 90 to 95 percent sequence identity, more usually at least 99 
percent sequence identity as compared to a reference sequence 
over a comparison window of at least 20 nucleotide positions, 
frequently over a window of at least 25-50 nucleotides, 
wherein the percentage of sequence identity is calculated by 

25 comparing the reference sequence to the polynucleotide 

sequence which may include deletions or additions which total 
20 percent or less of the reference sequence over the window 
of comparison. The reference sequence may be a subset of a 
larger sequence, for example, as a segment of the full-length 

30 [His 5 ,Trp 7 ,Tyr 8 ] -GnRH and [Ser 8 ] -GnRH sequences disclosed 
herein. 

As applied to polypeptides, the terms "substantial 
identity" or "substantial sequence identity" mean that two 
peptide sequences, when optimally aligned, such as by the 
35 programs GAP or BESTFIT using default gap weights, share at 
least 80 percent sequence identity, preferably at least 90 
percent sequence identity, more preferably at least 95 percent 
sequence identity or more (e.g., 99 percent sequence 
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identity) . Preferably, residue positions which are not 
identical differ by conservative amino acid substitutions. 
Conservative amino acid substitutions refer to the 
inter changeability of residues having similar side chains. 
5 For example, a group of amino acids having aliphatic side 

chains is glycine, alanine, valine, leucine, and isoleucine; a 
group of amiso acids having aliphatic -hydroxyl side chains is 
serine and threonine; a group of amino acids having amide - 
containing side chains is asparagine and glut amine; a group of 

10 amino acids having aromatic side chains is phenylalanine, 

tyrosine, and tryptophan; a group of amino acids having basic 
side chains. is lysine, arginine, and histidine; and a group of 
amino acids having sulfur- containing side chains is cysteine 
and methionine. Preferred conservative amino acids 

15 substitution groups are: valine-leucine- isoleucine, 

phenylalanine- tyrosine, lysine -arginine, alanine -valine, and 
asparagine - glut amine . 

The term "isolated" or " substantially pure" means a 
compound is the predominant species present (i.e., on a molar 

20 basis it is more abundant than any other individual species in 
the composition) , and preferably a substantially purifier 
fraction is a composition wherein the object species comprises 
at least about 50 percent (on a molar basis) of all 
macromolecular species present. Generally, a substantially 

25 pure composition will comprise more than about 80 to 90 
percent of all macromolecular species present in the 
composition. Most preferably, the species is purified to 
essential homogeneity (contaminant species cannot be detected 
in the composition by conventional detection methods) wherein 

30 the cong>osition consists essentially of a single 

macromolecular species. For example, "isolated" or 
■substantially pure", when referring to nucleic acids, refers 
to those that have been purified away from other chromosomal 
or ext rachramos amal DNA or RNA by standard techniques, 

35 including alkaline /SDS treatment, CsCl banding, column 

chromatography, and other techniques well known in the art. 
See, F. Ausubel, et al., ed. Current Protocols In Molecular 
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Biology, Greene Publishing and Wiley- Inters cience, New York 
(1987), incorporated herein by reference. 

"Nucleic acid probes" may be DNA or RNA fragments* 
DNA fragments prepared, for example, by digesting plasmid DNA, 
5 or by use of PCR, or synthesized by either the phosphoramidite 
method described by Beaucage and Carruthers, Tetrahedron Lett. 
22:1859-1862 (1981), or by the triester method according to 
Matteucci, eta!., J. Am. Cfaezn. Soc, 103:3185 (1981), both 
incorporated herein by reference. A double stranded fragment 

10 may then be obtained, if desired, by annealing the chemically 
synthesized single strands together under appropriate 
conditions or by synthesizing the complementary strand using 
DNA polymerase with an appropriate primer sequence. Where a 
specific sequence for a nucleic acid probe is given, it is 

15 understood that the complementary strand is also identified 

and included. The complementary strand will work equally well 
in situations where the target is a double -stranded nucleic 
acid. 

The phrase "selectively hybridizing to" refers to a . 

20 nucleic acid probe that hybridizes, duplexes or binds only to 
a particular target DNA or RNA sequence when the target 
sequences are present in a preparation of total cellular DNA 
or RNA. "Complementary" or "target" nucleic acid sequences 
refer to those nucleic acid sequences which selectively 

25 hybridize to a nucleic acid probe. Proper annealing 

conditions depend, for example, upon a probe's length, base 
composition, the number of mismatches and their position 
on the probe, must often be determined empirically. For 
discussions of nucleic acid probe design and annealing 

30 conditions, see, for example, Sambrook et a!.. Molecular 

Cloning: A Laboratory Manual (2nd ed.). Vols. 1-3, Cold Spring 
Harbor Laboratory, (1989) or Current Protocols in Molecular 
Biology, P. Ausubel et al., ed. Greene Publishing and Wiley - 
Interscience, New York (1987) . 

35 The phrase "nucleic acid sequence encoding" refers 

to a nucleic acid which directs the expression of a specific 
protein or peptide. The nucleic acid sequences include both 
the DNA strand sequence that is transcribed into RNA and the 
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RNA. sequence that is translated into protein. The nucleic 
acid sequences include both the full length nucleic acid 
sequences as well as non-full length sequences derived from 
the full length protein. It being further understood that the 
5 sequence includes the degenerate codons of the native sequence 
or sequences which may be introduced to provide codon 
preference in a specific host cell. 

The phrase "expression cassette", refers to 
nucleotide sequences which are capable of affecting expression 

10 of a structural gene in hosts compatible with such sequences. 
Such cassettes include at least promoters and optionally, 
transcription termination signals. Additional factors 
necessary or helpful in effecting expression may also be used 
as described herein. 

15 The term "operably linked" as used herein refers to 

linkage of a promoter upstream from a DNA sequence such that 
the promoter mediates transcription of the DNA sequence. 

The term "vector", refers to viral expression 
systems, autonomous self -replicating circular DNA (plasmids) , 

20 and includes both expression and nonexpression plasmids. 

Where a recombinant microorganism or cell culture is described 
as hosting an "expression vector, " this includes both 
extrachramosamal circular DNA and DNA that has been 
incorporated into the host chromosome (s) . Where a vector is 

25 being maintained by a host cell, the vector may either be 

stably replicated by the cells during mitosis as an autonomous 
structure, or is incorporated within the host's genome. 

The term "plasmid" refers to an autonomous circular 
DNA molecule capable of replication in a cell, and includes 

30 both the expression and nonexpression types. Where a 

recombinant microorganism or cell cult re is described as 
hosting an "expression plasmid", this includes both 
extrachromosomal circular DNA molecules and DNA that has been 
incorporated into the host chromosome (s) . Where a plasmid is 

35 being maintained by a host cell, the plasmid is either being 

stably replicated by the cells during mitosis a. an autonomous 
structure or is incorporated within the host's genome. 
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The phrase "recombinant protein" or " recombinant ly 
produced protein" refers to a peptide or protein produced 
using non-native cells that do not have an endogenous copy of 
DNA able to express the protein. The cells produce the 
5 protein because they have been genetically altered by the 
introduction of the appropriate nucleic acid sequence. The 
recombinant protein will not be found in association with 
proteins and other subcellular components normally associated 
with the cells producing the protein. 

10 "Specifically immunoreactive n refers to a binding 

reaction between an antibody and antigen which is 
determinative of the presence of the antigen in the presence 
of a heterogeneous population of proteins and other biological 
macromolecules or in a biological sample. Thus, under 

15 designated immunoassay conditions, the specified antibodies 

bind to a particular protein and do not bind in a significant 
amount to other proteins present in the sample. Specific 
binding to an antibody under such conditions may require an 
antibody that is selected for its specificity for a particular 

20 protein. For example, antibodies raised to the H. Jburtoni 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone with the amino acid 
sequence depicted in Seq. ID No. 2 or to the treeshrew 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone sequence depicted in Seq. 
ID No. 18 can be selected to obtain antibodies specifically 

25 immunoreactive with [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 

proteins and not to other proteins. Homologous proteins to 
the H. burtoni [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone and the 
treeshrew [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone encompass 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone species, but do not 

30 include other proteins. Similarly, antibodies raised to the 
H. burtonl [Ser 8 ] -GnRH preprohonnone with the amino acid 
sequence depicted in Seq. ID No. 20 can be selected to obtain 
antibodies specifically immunoreactive with [Ser 8 ] -GnRH . 
preprohonnone proteins and not to other proteins. Homologous 

35 proteins to the H. burtoni [Ser 8 ] -GnRH preprohonnone encompass 
[Ser 8 ] -GnRH preprohonnone species, but do not include other 
proteins . 
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A variety of immunoassay formats may be used to 
select antibodies specifically immunoreactive with a 
particular protein. For example, solid-phase ELISA 
immunoassays are routinely used to select monoclonal 
antibodies specif ically immunoreactive with a protein. See 
Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold 
Spring Harbor Publications, New York, for a description of 
immunoassay formats and conditions that can be used to 
determine specific immunoreactivity. 

"Biological sample" as used herein refers to any 
sample obtained from a living organism or from an organism 
that has died. Examples of biological samples include body 
fluids and tirsue specimens * 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1A is a comparison of known 6nRH amino acid 
sequences. The primary structures of the eight known GnRH 
forms and of the first n i ne amino- terminal residues from yeast 
oe-mating factor are shown. 

Fig. IB shows the predicted amino acid sequence of 
the preprohormone for H. burtoni [Trp 7 , Leu 8 ] -GnRH as compared 
with the [His 5 , Trp 7 , Tyr*] -GnRH peptide. The two residues at 
which these forms differ are indicated in bold print. 
Degenerate 5 V oligonucleotides used for PCR are shown below. 

Fig. 2 shows the cDNA and predicted amino acid 
sequence of H. burtoni prepro [His 5 , Trp 7 , Tyr 8 ] -GnRH. The 
functional domains of prepro [His 5 , Trp 7 , Tyr 8 ] -GnRH are 
illustrated and compared to H. burtoni prepro [Trp 7 , Leu 8 ] -GnRH . 
Amino acids are numbered beginning at the first residue in the 
GnRH decapeptide . The functional domains of both 
preprohormones are illustrated. Hydrophobic signal sequences 
(negative numbers) are directly followed by the GnRH 
decapeptide region (bold) and residues involved with 
posttranslational processing. The GAP peptide sequences 
follow the decapeptide and posttranslational processing 
sequences. A second processing site (underlined) indicates 
coding sequences for two novel peptides in the 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone. 
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Pig. 3 shows the results of high stringency Northern 
blot analysis using total RNA from ventral, dorsal or whole 
brain, hybridized to 32 P-labelled prepro [His 5 , Trp 7 , Tyr 8 ] -GnRH 
riboprobe. Lanes 1, 2, and 3 were loaded with 5, 10 and 20 
5 fig, respectively, of total RNA extracted from a pool of 3 
brains. Lanes 4 through 7 contain 10 fig of total RNA each 
from one half of a single brain. Lanes 4 and 6 contain 
ventral brain RNA while lanes 5 and 7 contain RNA from the 
corresponding dorsal halves. One band of approximately 530 

10 bases is seen in lanes containing ventral or total brain RNA. 
No signal was detected in total RNA extracted from dorsal 
brain halves, localizing the expression of this transcript 
exclusively to more ventral brain regions. Methylene blue 
staining of ribosomal bands confirmed that the equal amounts 

15 of total RNA were loaded into lanes 4-7 of the gel. 

Fig. 4 shows localization of prepro 
[His 5 , Trp 7 , Tyr 8 ] -GnRH mRNA in H. burtoni brain. TOP: 
schematic view of a mid- sagittal section from H. burtoni brain 
showing the three populations of neurons immunoreactive for 

20 GnRH. For simplicity, the clusters are represented within the 
same plane although the mesencephalic population lies lateral 
to both the terminal nerve and POA populations. The terminal 
nerve GnRH population is indicated by is indicated by the 

POA population by *, and the mesencephalic population by #. 

25 The bottom six panels show the labelling with each of the two 
GnRH gene probes in the three brain regions with cells known 
to contain GnRH from immunocytochemis try . LEFT COLUMN: in 
situ hybridization with a prepro [His 5 , Trp 7 , Tyr 8 ] -GnRH 
digoxygenin - "DTP riboprobe. Only the mesencephalic population 

30 was labeled. Scale bar = 100 pm. RIGHT COLUMN: in situ 

hybridization with a prepro [Trp 7 , Leu 8 ] -GnRH digoxygenin-UTP 
riboprobe. Only the terminal nerve population was labeled. 
Scale bar » 100 jmu Note that neither probe labels the 
POA/hypothalamic group of cells. 

35 Fig. 5 shows localization of prepro [Ser 8 ] -GnRH mRNA 

in H. burtoni brain. Midsagittal sections are shown in the 
panels. The top two panels show hybridization with a prepro 
[Ser 8 ] -GnRH riboprobe in subordinate and dominant male fish. 
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The bottom two panels show hybridization of the riboprobe in 
the terminal nerve area and the mesencephalon regions of the 
H. burton! brain of dominant males. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

This invention provides isolated nucleic acid 
sequences encoding precursor proteins for the chicken II form 
of GnRH, which is referred to here as the [His 5 ,Trp 7 f Tyr 8 ] - 
GnRH preprohormone, and for the [Ser 8 ] -GnRH preprohormone . 

10 The sequences can be used in a number of applications. For 
instance, they can be used for the recombinant production of 
the preprohormone polypeptides. In addition, transgenic 
animals (e.g., fish and chickens, and non-human mammals) can 
be generated using the sequences provided here. Diagnostic 

15 assays for detecting nucleic acids encoding the 

[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone or the [Ser 8 ] -GnRH 
preprohormone, as well as assays for these preprohormone 
proteins are also provided. These assays are particularly 
useful in the diagnosis of reproductive diseases or 

20 reproductive capacity in animals. Compositions and methods 
for using the sequences provided here are described in more 
detail below. 

A) <?nRH pireprohormpne polypeptides 

25 As described above, there are eight different forms 

of GnRH that have been isolated and characterized from 
vertebrate species (see figure 1) . A precursor for an 
additional form of GnRH, [Ser 8 ] -GnRH, is also described 
herein. Figure 1 shows all decapeptide sequences except 

30 [Ser 8 ] -GnRH with reference to the macimalian GnRH sequence. 

[Ser 8 ] -GnRH is a newly discovered form of GnRH, so that there 
are now 9 known forms of GnRH. Residues which differ from 
those in the mammalian form are identified both by position 
?iT n r i residue. Thus, teleost GnRH is referred to as 

35 [Trp 7 , Leu 8 ] -GnRH because, in teleosts, the Leu in position 7 

is Trp and the Arg in position B is Leu when compared with the 
mammalian GnRH sequence. As can be seen in GnRH, positions 1, 
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2, 4, 9, and 10 are invariant, and positions 3 and 7 show only 
conservative changes. 

The predicted amino acid sequence for the 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone from the teleost fish, H. 
5 burtoni, is shown in Figure 2 (Seq. ID No. 2). In addition, 
the predicted amino acid sequence for the treeshrew 
[His 5 , Trp 7 , Tyx 8 ] -GnRH preprohormone is shown in Seq ID No. IB. 
The precursor of a additional form of GnRH, [Ser 8 ] -GnRH, is 
shown in Seq. ID No. 20. The GnRH preprohormones have a 

10 characteristic structure consisting of three separate domains. 
The N- terminal domain consists of a signal peptide region 
which is a common feature of many secretory proteins. The 
second domain consists of the [His 5 , Trp 7 , Tyr 8 ] -GnRH 
decapeptide followed immediately by a 3 amino acid amidation 

15 and precursor processing site. The third, C- terminal, domain 
consists of a GnRH associated peptide or GAP peptide. The GAP 
peptide region of the [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone may 
generate two peptides because of an additional potential 
proteolytic processing site in this region of the 

20 preprohormone. Thus, a single [His 5 , Trp 7 , Tyr 8 ] -GnRH 

preprohormone GAP peptide consisting of the entire sequence of 
this region of the protein may be produced. Alternatively, 
there may be two [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone GAP 
peptides present. 

25 The [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone and the 

[Ser 8 ] -GnRH preprohormone represent two members of a family of 
GnRH preprohormones . Homology between the H. burton i and the 
treeshrew [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormones and the H. 
burtoni [Ser 8 ] -GnRH preprohormone was determined by the PASTA 

30 anH GAP computer programs (version 7.3 Unix, Genetics Computer 
Group, 575 Science Drive, Madison, Wisconsin, USA) . Using 
this procedure there is a 59% homology, and a 44% amino acid 
identity between the ami no acid sequences of H. burtoni and 
treeshrew [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormones. Also using 

35 this procedure, there is a 55% homology and a 33% identity 
between the amino acid sequences of the H. burtoni 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone and the H. burtoni 
[Ser 8 ] -GnRH preprohormone. 
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The terms n [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone n , 
" [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone polypeptides" or "chicken 
II GnRH preprohormone" as used herein refer to all polypeptide 
precursor forms of [His 5 , Trp 7 ,Tyr 8 ] -GnRH, but exclude the 
5 mature [His 5 ,Trp 7 ,Tyr 8 ] -GnRH decapeptide. Similarly, the 
terms " [Ser 8 ] -GnRH preprohormone" or " [Ser 8 ] -GnRH 
preprohormone polypeptides" as used herein refer to all 
polypeptide precursor forms of [Ser 8 ] -GnRH, but exclude the 
mature [Ser 8 ] -GnRH decapeptide. These terms also refer to 

10 biologically active fragments of the precursor proteins for 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH and [Ser 8 ] -GnRH respectively. 
Significant biological activities include GnRH activity, 
latent GnRH activity and immunological activity* Latent GnRH 
activity that the biologically active fragment can be 

15 proteolytically processed to yield active GnRH. 

Immunological activity refers to immunoreactivity 
with an antibody raised to a full-length [His 5 , Trp 7 ,Tyr 8 ] -GnRH 
or [Ser 8 ] -GnRH preprohormone or to segments of these 
preprohormones . A segment of a [His 5 , Trp 7 ,Tyr 8 ] -GnRH 

20 preprohormone or [Ser 8 ] -GnRH preprohormone will ordinarily 

comprise at least about 5 contiguous amino acids, typically at 
least about 7 contiguous amino acids, more typically at least 
about 9 contiguous amino acids, usually at least about 11 
contiguous amino acids, preferably at least about 13 

25 contiguous amino acids, more preferably at least about 16 

contiguous amino acids, and most preferably at least about 20 
to 30 or more contiguous amino acids from the preprohormone. 
Segments of a particular domain will be segments of the 
appropriate size within the corresponding domain. 

30 Biologically active fragments include the [His 5 ,Trp 7 ,Tyr 8 ] - 
GnRH preprohormone ***** [Ser 8 ] -GnRH preprohormone GAP peptide 
or peptides. 

The terms "GnRH related peptide", "GAP" or "GAP 
peptide" refer to the peptide of the C- terminal domain of a 
35 GnRH preprohormone, as described above for the 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone and the [Ser 8 ] -GnRH 
preprohormone. Previously known GAP peptides are further 
described in Sherwood, N.A., et al., supra. The terms 
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n [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone GAP peptide" or "chicken 
II GnRH preprohonnone GAP peptide" refer to the GAP peptide or 
peptides derived from the [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohonnone. 
The term ■ [Ser 8 ] -GnRH preprohonnone GAP peptide" refers to the 
5 GAP peptide or peptides derived from the [Ser 8 ] -GnRH 

preprohonnone. The [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone and 
[Ser 8 ] -GnRH preprohonnone GAP peptides are produced by the 
proteolytic processing of [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 
and [Ser 8 ] -GnRH preprohonnone, respectively. The 

10 [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohonnone GAP peptide is a 

biologically active [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 
fragment, as defined above, and is therefore included in the 
definition of [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 
polypeptides. Similarly, the [Ser 8 ] -GnRH preprohonnone GAP 

15 peptide is a biologically active [Ser 8 ] -GnRH preprohonnone 

fragment, as defined above, and is therefore included in the 
definition of [Ser 8 ] -GnRH preprohonnone polypeptides. 

The above -defined terms refer not only to the 
protein having the amino acid sequence disclosed here, but 

20 also to other proteins that are allelic, nonallelic or species 
variants of the H. burton! and treeshrew [His 5 ,Trp 7 , Tyr 8 ] -GnRH 
preprohonnones and the H. Burtoni [Ser 8 ] -GnRH preprohonnone, 
as well as to natural or induced mutant forms of these 
proteins . For example , [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 

25 polypeptides generally show substantial sequence identity 

(determined as described above) to the amino acid sequence of 
the H. Jburtoni [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone shown in 
figure 2 or to the amino acid sequence of the treeshrew 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone shown in Seq. ID No. 18. 

30 For example, fish [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone 

polypeptides generally show substantial sequence identity to 
the amino acid sequence of the H. Jburtoni [His 5 , Trp 7 , Tyr 8 ] - 
GnRH preprohonnone shown in figure 2, and m a mm alian 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone polypeptides generally 

35 show substantial sequence identity (determined as described 
above) to the amino acid sequence of the treeshrew 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohonnone shown in Seq. ID No. 18. 
Similarly, [Ser 8 ] -GnRH preprohonnone polypeptides generally 
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show substantial sequence identity to the amino acid sequence 
of the H. burtoni [Ser 8 ] -GnRH preprohormone shown in Seq ID 
No* 20. 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone polypeptides 
5 will also typically be specifically iir *ioreactive with 

antibodies raised against the H. burtc. [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 
preprohormone of figure 2 or against the treeshrew 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone shown in Seq. ID No. 
[???]. In particular, fish [His 5 , Trp 7 ,Tyr 8 ] -GnRH 
10 preprohormone polypeptides will typically be specifically 

immunoreactive with antibodies raised against the H. burtoni 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone of figure 2, and mammalian 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone polypeptides will 
typically be specifically immunoreactive with antibodies 
15 raised against the treeshrew [His 5 , Trp 7 ,Tyr 8 ] -GnRH 

preprohormone shown in Seq. ID No. 18. Similarly, [Ser 8 ] - 
GnRH preprohormone polypeptides will generally be specifically 
immunoreactive with antibodies raised against the H. burtoni 
[Ser 8 ] -GnRH preprohormone shown in Seq. ID No. 20, 
20 A [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone that 

specifically binds to or that is specifically immunoreactive 
to an antibody generated against a defined immunogen, such as 
an immunogen consisting of the amino acid sequence of Seq. ID 
No. 2 or Seq. ID No. 18 is determined in an immunoassay. The 
25 immunoassay uses a polyclonal antiserum which was raised to 
the protein of Seq. ID No. 2 or Seq. ID No. 18- This 
antiserum is selected to have low crossreactivity against GnRH 
preprohormones other than [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 
preprohormones . Crossreactivity to these other forms of GnRH 
30 preprohormone is removed by immur, ^absorption prior to use in 
the immunoassay. 

In order to produce antisera to [His 5 ,Trp 7 ,Ty r8 ] - 
GnRH precursor for use in an immunoassay, the proteins of Sec; 
ID No. 2 and Seq. ID No. 18 are isolated as describ a herei* 
35 For example, recombinant protein is produced in a mammal iar 
cell line. An inbred strain of mice such as balb/c or rab; 
are immunized with the protein of Seq. ID No. 2 or Seq. H 
No. 18 using a standard adjuvant, such as Freund's adjuvant. 
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anH a standard mouse immunization protocol (see Harlow and 
Lane, supra) . Polyclonal sera are collected and titered 
against the immunogen protein in an immunoassay, for example, 
a solid phase immunoassay with the immunogen immobilized on a 
5 solid support. Polyclonal antisera with a titer of 10 4 or 
^greater are selected and tested for their cross reactivity 
against GnRH preprohormones other than [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 
preprohormone , using a competitive binding immunoassay such as 
the one described in Harlow and Lane, supra, at pages 570-573. 

10 Three non- [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormones are used in 

this determination: the mammalian GnRH preprohormone obtained 
from human, rat, and mouse (see Sherwood, et al. (1993), 
supra) ; the [Trp 7 , Leu 8 ] -GnRH preprohormone obtained from 
Salmon and H. burtont, and the [Ser 8 ] -GnRH preprohormone 

15 obtained from H. Jburtoni. These non- [His 5 , Trp 7 , Tyr 8 ] -GnRH 
preprohormones can be produced as recombi n a n t proteins and 
isolated using standard molecular biology and protein 
chemistry techniques as described herein. 

Immunoassays in the competitive binding format can 

20 be used for the crossreactivity determinations. For example, 
the protein of Seq. ID No. 2 or Seq. ID No. 18 can be 
immobilized to a solid support. Proteins added to the assay 
compete with the binding of the antisera to the immobilized 
antigen. The ability of the above proteins to compete with 

25 the binding of the antisera to the immobilized protein is 

compared to the protein of Seq. ID No. 2 or Seq. ID No. 18. 
The percent crossreactivity for the above proteins is 
calculated, using standard calculations. Those antisera with 
less t hffn 10% crossreactivity with each of the proteins listed 

30 above are selected pooled. Antisera raised against the 

protein of Seq ID No. 2 and the protein of Seq. ID No. 18 are 
pooled separately. The cross -reacting antibodies are then 
removed from the pooled antisera by immunoabsorption with the 
above-listed proteins. 

35 The immunoabs orbed and pooled antisera cure then used 

in a competitive binding immunoassay as described above to 
compare a second protein to the immunogen protein. In order 
to make this comparison, the two proteins are each assayed at 
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a wide range of concentrations and the amount of each protein 
required to inhibit 50% of the binding of the antisera to the 
immobilized protein is determined. For example, if the amount 
of the second protein required is less than 10 times the 
amount of the protein of Seq. ID No. 2 required, then the 
second protein is said to specifically bind to an antibody 
generated to an immunogen consisting of the protein of Seq. ID 
No. 2. 

A [Ser 8 ] -GnRH preprohormone that specifically binds 
to or that is specifically immunoreactive to an antibody 
generated against a defined immunogen, such as an immunogen 
consisting of the amino acid sequence of Seq. ID No. 20 can 
also determined in an immunoassay, by using a procedure 
similar to that described above for [His 5 , Trp 7 ,Tyr 8 ] -GnRH 
preprohormones . In this case, however, polyclonal antisera 
are selected and tested for their cross reactivity against 
GnRH preprohormones other than [Ser 8 ] -GnRH preprohormone. The 
non- [Ser 8 ] -GnRH preprohormones used in this deter m i n ation are: 
the mammalian GnRH preprohormone obtained from hu m a n , ra~, and 
mouse (see Sherwood, et al. (1993), supra); the [Trp 7 ,Lea 8 ] - 
GnRH preprohormone obtained from Salmon and H. burton! , and 
the [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone obtained from H. 
burton! and the treeshrew. 

B- Nucleic acid co mpositions for rHis 5 .Trp 7 .Tvx 8 l -GnRH and 
fSer 8 1 -GnR H preprohormone polypeptide? 

This invention relates to isolated nucleic acid 
sequences encoding [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone and 
[Ser 8 ] -GnRH preprohormone polypeptides. The nucleic acid 
compositions of this invention, whether RNA, cDNA, genomic 
DNA, or a hybrid of the various combinations, may be isolated 
from natural sources or may be synthesized in vitro. The 
nucleic acids claimed may be present in transformed or 
transfected whole cells, in a transformed or transfected cell 
lysate, or in a partially purified or substantially pure form. 

The nucleic acid sequences of the invention are 
typically identical to or show substantial sequence identity 
(determined as described above) to the nucleic acid sequence 
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of SEQ ID. No. 1, Seq. ID No. 17, or Seq. ID No. 19. For 
example, nucleic acids encoding [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 
preprohorxnone polypeptides and which show substantial sequence 
identity to Seq. ID No. 1 will typically hybridize to the 
5 nucleic acid sequence of Seq. ID No. 1 under stringent 
conditions. Similarly, nucleic acids encoding 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone polypeptides and which 
show substantial sequence identity to Seq. ID No. 17 will 
typically hybridize to the nucleic acid sequence of Seq. ID 

10 No. 17 under stringent conditions. With regard to [Ser 8 ] -GnRH 
preprohormones, nucleic acids encoding [Ser 8 ] -GnRH 
preprohormone polypeptides and which show substantial sequence 
identity to Seq. ID No. 19 will typically hybridize to the 
nucleic acid sequence of Seq. ID No. 19 under stringent 

15 conditions. Stringent conditions are sequence dependent and 
will be different in different circumstances. Generally, 
stringent conditions are selected to be about 5° C lower than 
the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature 

20 (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. 
Typically, stringent conditions will be those in which the 
salt concentration is at least about 0.02 molar at pH 7 and 
the temperature is at least about 60 °C. As other factors may 

25 significantly affect the stringency of hybridization, 

including, among others, base composition and size of the 
complementary strands, the presence of organic solvents and 
the extent of base mismatching, the combination of parameters 
is more important than the absolute measure of any one. 

30 Techniques for nucleic acid manipulation of genes 

encoding these polypeptides such as sub cloning nucleic acid 
sequences encoding polypeptides into expression vectors, 
labelling probes, DNA hybridization, and the like are 
described generally in Sambrook, et al., Molecular Cloning - A 

35 Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989, which is 
incorporated herein by reference. This manual is hereinafter 
referred to as "Sambrook, et al." 
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There are various methods of isolating the DNA 
sequences encoding [His 5 ,Trp 7 , Tyr 8 ] -GnRH preprohormone and 
[Ser 8 ]-6nRH preprohormone polypeptides. For example, the DNA 
is isolated from a genomic or cDNA library using labelled 
5 oligonucleotide probes having sequences complementary to the 
sequences disclosed here. Restriction endonuclease digestion 
of genomic DNA or cDNA containing the [His 5 , Trp 7 , Tyr 8 ] -GnRH 
preprohormone gene or the [Ser 8 ] -GnRH preprohormone gene can 
be used to isolate nucleic acids encoding these proteins. 

10 Since the DNA sequences encoding [His 5 , Trp 7 , Tyr 8 ] -GnRH and 
[Ser 8 ] -GnRH preprohormones are provided here, a panel of 
restriction endonucleases can be constructed to give cleavage 
of the DNA in the desired regions. After restriction 
endonuclease digestion, DNA encoding [His 5 , Trp 7 , Tyr 8 ] -GnRH and 

15 [Ser 8 ] -GnRH preprohormones is identified by its ability to 
hybridize with nucleic acid probes, for example on Southern 
blots, and these DNA regions cure isolated by standard methods 
familiar to those of skill in the art. See Sambrook, et al. 

Various methods of amplifying target sequences, such 

20 as the polymerase chain reaction can also be used to prepare 
[His 5 , Trp 7 , Tyr 8 ] -GnRH or [Ser 8 ] -GnRH preprohormone DNA. 
Polymerase chain reaction technology (PCR) is used to amplify 
nucleic acid sequences of the [His 5 , Trp 7 , Tyr 8 ] -GnRH or [Ser 8 ] - 
GnRH preprohormone polypeptides directly from mRNA, from CDNA, 

25 and from genomic libraries or cDNA libraries. The isolated 
sequences encoding [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone or 
[Ser 8 ] -GnRH preprohormone may also be used as templates for 
PCR amplification . 

Appropriate primers and probes for amplifying 

30 nucleic acids encoding nucleic acids encoding the 

[His 5 , Trp 7 , Tyr 8 ] -GnRH or the [Ser 8 ] -GnRH preprohormone 
polypeptides can be generated from analysis of the DNA 
sequences. In brief, oligonucleotide primers complementary to 
the two 3' borders of the DNA region to be amplified are 

35 synthesized. The polymerase chain reaction is then carried 
out using the two primers. See PCR Protocols; A Guide to 
Methods and Applications. (Innis, M, Gelfand, D., Sninsjcy, J. 
and White, T., eds.). Academic Press, San Diego (1990). 
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Primers can be selected to amplify the entire regions encoding 
the full-length [His 5 ,Trp 7 ,Tyr 8 ] -GnRH or [Ser 8 ] -GnRH 
preprohormones or to amplify smaller DNA segments of these 
preprohormones as desired. 
5 Oligonucleotides for use as probes are chemically 

synthesized according to the solid phase phosphoramidite 
triester method first described by Beaucage, S.L. and 
Carruthers, M.H., 1981, Tetrahedron Letts. f 22 (20) :1859-1862 
using em automated synthesizer, as described in 

10 Needham- VanDevanter , D.R., et al. , 1984, Nucleic Acids Res*, 
12:6159-6168. Purification of oligonucleotide is by either 
native acrylamide gel electrophoresis or by anion- exchange 
HPLC as described in Pearson, J.D. and Regnier, F.E., 1983, J. 
Chrom. , 255:137-149. 

15 The sequence of the synthetic oligonucleotide can be 

verified using the chemical degradation method of Maxam, A.M. 
and Gilbert, 1980, in W. , Grossman, L. and Moldave, D. f eds. 
Academic Press, New York, Methods in Enzymology, 65:499-560. 

Other methods known to those of skill in the art may. 

20 also be used to produce and isolate nucleic acids encoding 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone or [Ser 8 ] -GnRH 
preprohormone . See Sambrook, et al. for a description of 
other techniques for the isolation of DNA encoding specific 
protein molecules. 

25 

C. Expression of fHis 5 » Tra 7 , Tvr 8 ] -GnRH preprohormone and 
TSer 8 1-GnRH preprohormone. 

Once the DNA encoding [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 
preprohormone or the [Ser 8 ] -GnRH preprohormone is isolated and 
30 cloned, one may express these preprohormones in a variety of 
recombinantly engineered cells. It is expected that those of 
skill in the art are knowledgeable in the numerous expression 
systems available for expression of the DNA encoding these 
preprohormones. No attempt to describe in detail the various 
35 methods known for the expression of proteins in prokaryotes or 
eukaryotes is made here. 

In brief summary, the expression of natural or 
synthetic nucleic acids encoding [His 5 ,Trp 7 ,Tyr 8 ] -GnRH or 
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[Ser 8 ] -GnRH preprohormone will typically be achieved by 
operably linking the DNA or cDNA to a promoter (which is 
either constitutive or inducible) , followed by incorporation 
into an expression vector. The vectors can be suitable for 
5 replication and integration in either prokaryotes or 

eukaryotes. Typical expression vectors contain transcription 
and translation terminators, initiation. sequences, and 
promoters useful for regulation of the expression of the 
polynucleotide sequence encoding [His s; rrp 7 ,Tyr 8 ] -GnRH or 

10 [Ser 8 ] -GnRH preprohormone polypeptides. To obtain high level 
expression of a cloned gene, such as those polynucleotide 
sequences encoding [His 5 , Trp 7 ,Tyr 8 ] -GnRH or [Ser 8 ] -GnRH 
preprohormone, it is desirable to construct expression 
plasmids which contain, at the minimum, a strong promoter to 

15 direct transcription, a ribosome binding site for 

translational initiation, and a transcription/ translation 
terminator. The expression vectors may also comprise generic 
expression cassettes containing at least one independent 
terminator sequence, sequences permitting replication of the 

20 plasmid in both eukaryotes and prokaryotes, i.e., shuttle 
vectors, and selection markers for both prokaryotic and 
eukaryotic systems. See Sambrook et al. Examples of 
expression of [His 5 , Trp 7 ,Tyr 8 ] -GnRH and [Ser 8 ] -GnRH 
preprohormone polypeptides in both prokaryotic and eukaryotic 

25 systems are described below. 

1. Expression in Prokaryotes 

A variety of procaryotlc expression systems may be 
used to express [His 5 , Trp 7 ,Tyx 8 ] -GnRH preprohormone or [Ser 8 ] - 

30 GnRH preprohormone polypeptides. Examples include E. coll, 
Bacillus, Streptoxnycea , and the like. For example, 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone or [Ser 8 ] -GnRH 
preprohormone polypeptides may be expressed in E. coli. As 
another example, these preprohormone polypeptides may be 

35 expressed in fish cells. Other fish proteins such as rainbow 
trout growth hormone have been successfully expressed in E. 
coll {See, e.g., Agellon et al. (1986) Dm 5:463-477 and U.S. 
Patent No, 4,849,359). 
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It is essential to construct expression plasmids 
which contain, at the minimum, a strong promoter to direct 
transcription, a ribosome binding site for translational 
initiation, and a transcription/ translation terminator. 
5 Examples of regulatory regions suitable for this purpose in E. 
coli are the promoter and operator region of the E. coli 
tryptophan biosynthetic pathway as described by Yanofsky, C, 
1984, J. Bacteriol., 158:1018-1024 and the leftward promoter 
of phage lambda (P L ) as described by Herskowitz, I. and Hagen, 

10 D., 1980, Ann* Rev. Genet., 14:399-445. The inclusion of 
selection markers in UNA vectors transformed in E. coli is 
also useful. Examples of such markers include genes 
specifying resistance to ampicillin, tetracycline, or 
chloramphenicol . See Sambrook et al. for details concerning 

15 selection markers for use in E. coli. 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone and [Ser 8 ] -GnRH 
preprohormone polypeptides produced by prokaryotic cells may 
not necessarily fold properly. During purification from E. 
coli, the expressed protein may first be denatured and then 

20 renatured. This can be accon^lished by solubilizing the 

bacterially produced proteins in a chaotropic agent such as 
guanidine HC1 reducing all the cysteine residues with a 
reducing agent such as beta-mercaptoethanol. The protein is 
then renatured, either by slow dialysis or by gel filtration. 

25 See U.S. Patent No. 4, 511,503. 

Detection of the expressed antigen is achieved by 
fflPt-brvjp known in the art as radioimmunoassays, or Western 
blotting techniques or immunoprecipitation. Purification from 
E. coli can be achieved following procedures described in U.S. 

30 Patent No. 4,511,503. 

2, Expre ssion in Bukarvotes 

A variety of eukaryotic expression systems such as 
yeast, insect cell lines, bird, fish, and mammalian cells, are 
35 known to those of skill in the art. As explained briefly 
below, [His 5 , Trp 7 , Tyr 8 ] - GnRH preprohormone or [Ser 8 ] -GnRH 
preprohormone polypeptides may also be expressed in these 
eukaryotic systems. 
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Synthesis of heterologous proteins in yeast is well 
known and described. Methods in Yeast Genetics, Sherman, F., 
et ai., Cold Spring Harbor Laboratory, (1982) is a well 
recognized work describing the various methods available to 
5 produce the preprohormone in yeast. 

Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate 
kinase or other glycolytic enzymes, and an origin of 
replication, termination sequences and the like as desired. 

10 For instance, suitable vectors are described in the literature 
(Botstein, et ai., 1979, Gene, 8:17-24; Broach, et al. f 1979, 
Gene, 8:121-133). 

Two procedures are used in transforming yeast cells. 
In one case, yeast cells are first converted into protoplasts 

15 using zymolyase, lyticase or glusulase, followed by addition 
of DNA and polyethylene glycol (PEG) . The PEG-treated 
protoplasts are then regenerated in a 31 agar medium under 
selective conditions. Details of this procedure are given in 
the papers by J.D. Beggs, 1978, tfature (London), 275:104-109; 

20 and Hinnen, A., et al. f 1978, Proc. Natl. Acad. Sci. USA, 

75:1929-1933. The second procedure does not involve removal 
of the cell wall* Instead the cells are treated with lithium 
chloride or acetate and PEG and put on selective plates (Ito, 
H., et al. # 1983, «T. Bact., 153:163-168). 

25 [His 5 , Trp 7 , Tyr 8 ] - GnRH preprohormone or [Ser 8 ] - GnRH 

preprohormone polypeptides, once expressed, can be isolated 
from yeast by lysing the cells and applying standard protein 
isolation techniques to the lysates. The monitoring of the 
purification process can be accomplished by using Western blot 

30 techniques or radioimmunoassays of other standard immunoassay 
techniques . 

The sequences encoding the [His 5 , Trp 7 , Tyr 8 ] -GnRH 
preprohormone or [Ser 8 ] -GnRH preprohormone polypeptides can 
also be ligated to various expression vectors for use in 
35 transforming cell cultures of, for instance, mammalian, 

insect, bird or fish origin. Illustrative of cell cultures 
useful for the production of the polypeptides are mammalian 
cells. Mammal iam cell systems often will be in the form of 
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monolayers of cells although mammalian cell suspensions may 
also be used. A number of suitable host cell lines capable of 
secreting intact proteins have been developed in the art, and 
include the CHO cell lines, various human cells such as COS 
5 cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. 
Expression vectors for these cells can include expression 
control sequences, such as an origin of replication, a 
promoter (e.g., a HSV tic promoter or pgk (phosphoglycerate 
kinase) promoter), an enhancer (Queen et al. (1986) Ixnmunol. 

10 Rev. 89:49), and necessary processing information sites, such 
as ribosame binding sites, RNA splice sites, polyadenylation 
sites (e.g., an SV40 large T Ag poly A addition site) , and 
transcriptional terminator sequences. 

Other animal cells useful for production of the 

15 preprohormone are available, for instance, from the American 

Type Culture Collection Catalogue of Cell Lines and Hybridamas 
(7th edition, 1992) . For example, fish cells that can be used 
include cells derived from rainbow trout, salmon, and the 
like. Suitable promoters and vectors are described for 

20 instance, in Friedenreich et al., (1990) Nuc. Acids Res. 
18:3299-3305. 

Appropriate vectors for expressing preprohormone 
polypeptides in insect cells are usually derived from the SF9 
baculovirus. Suitable insect cell lines include mosquito 

25 larvae, silkworm, armyworm, moth and Drosqphila cell lines 

such as a Schneider cell line (See Schneider J. Embryol. Exp. 
Morphol. 27:353-365 (1987). 

As indicated above, the vector, e.g. , a plasmid, 
which is used to transform the host cell, preferably contains 

30 DNA sequences to initiate transcription and sequences to 

control the translation of the preprohormone gene sequence. 
These sequences are referred to as expression control 
sequences . 

As with yeast, when higher animal host cells are 
35 employed, polyaden 1 ya t i on or transcription terminator 

sequences from known mammalian genes need to be incorporated 
into the vector. An example of a terminator sequence is the 
polyadenlyation sequence from the bovine growth hormone gene. 
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Sequences for accurate splicing of the transcript may also be 
included. An example of a splicing sequence is the VP1 intron 
from SV40 (Sprague, J. et al., 1983, J. Virol. 45: 773-781). 

Additionally, gene sequences to control replication 
5 in the host cell may be incorporated into the vector such as 
those found in bovine papilloma virus type-vectors. 
Saveria-Campo, M. , 1985, "Bovine Papilloma virus DNA a 
Bukaryotic Cloning Vector" in DNA Cloning Vol. II a Practical 
Approach Ed. D.M. Glover, IRL Press, Arlington, Virginia pp. 
10 213-238. 

The host cells are competent or rendered competent 
for transformation by various means. There are several 
well-known methods of introducing DNA into animal cells. 
These include: calcium phosphate precipitation, fusion of the 

15 recipient cells with bacterial protoplasts containing the DNA, 
treatment of the recipient cells with liposomes containing the 
DNA, DBAB dextran, eiectroporation and micro- injection of the 
DNA directly into the cells. 

The transformed cells are cultured by means well 

20 known in the art. Biochemical Methods in Cell Culture and 
Virology, Kuchler, R.J., Dowden, Hutchinson and Ross, Inc., 
(1977) . The expressed preprohormone polypeptides are isolated 
from cells grown as suspensions or as monolayers. The latter 
are recovered by well known mechanical, chemical or enzymatic 

25 means. 

D. Pttrt^caUpn q£ fHig 5 , yrp 7 . Tyr 8 ] -gnRff prgprohgrmpne or 
f 9ft7^1 -^*H prrorohormone polypeptides 

The polypeptides produced by recombinant DNA 

30 technology may be purified by standard techniques well known 
to those of skill in the art. Recombinantly produced 
preprohormone polypeptides can be directly express ad or 
expressed as a fusion protein. The protein is then purified 
by a combination of cell lysis (e.g., sonication) and affinity 

35 chromatography. For fusion products, subsequent digestion of 
the fusion protein with an appropriate proteolytic enzyme 
release the desired polypeptide. 
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The polypeptides of this invention may be purified 
to substantial purity by standard techniques well known in the 
art, including selective precipitation with such substances as 
ammonium sulfate, column chromatography, immunopurif ication 
5 methods, anH others. See, for instance, R. Scopes, Protein 
Purification; Principles and Practice, Springer-Verlag: New 
York (1982), incorporated herein by reference. 

E. Production of rHis 5 -Trp 7 .T yr 8 l -GnRH preprohormone or 
10 TSer 8 1 -GnRH preprohormone polypeptides by protein 

chemistry techniques 

The polypeptides of the invention can be 
synthetically prepared in a wide variety of ways. For 
instance polypeptides of relatively short size, can be 
15 synthesized in solution or on a solid support in accordance 

with conventional techniques. Various automatic synthesizers 
are commercially available and can be used in accordance with 
known protocols. See, for example, Stewart and Young, Solid 
Phase Peptide Synthesis, 2d. ed., Pierce Chemical Co. (1984). 

20 

F. Modification of nu cleic acid and polypeptide sequences 

The nucleotide sequences used to trans feet the host 
cells used for production of recombinant preprohormone 
polypeptides can be modified according to standard techniques 

25 to yield preprohormone polypeptides with a variety of desired 
properties. The polypeptides of the present invention can be 
readily designed and manufactured utilizing various 
recombinant DNA techniques well known to those skilled in the 
art. For example, the polypeptides can vary from the 

30 naturally- occurring sequence at the primary structure level by 
*wri nn acid insertions, substitutions, deletions, and the like. 
These modifications can be used in a number of combinations to 
produce the final modified protein chain . 

The amino acid sequence variants can be prepared 

35 with various objectives in mind, including facilitating 

purification and preparation of the recombinant polypeptides. 
The modified polypeptides are also useful in, for example, 
modifying plasma half-life, improving therapeutic efficacy. 
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and lessening the severity or occurrence of side effects 
during therapeutic use. The amino acid sequence variants are 
usually predetermined variants not found in nature but exhibit 
the same immunogenic activity as naturally occurring 
5 polypeptides. For instance, polypeptide fragments canqprising 
only a portion (usually at least about 60-80%, typically 90- 
95%) of the primary structure may be produced. In general, 
modifications of the sequences encoding the polypeptides may 
be readily accomplished by a variety of well-known techniques, 
10 such as site-directed mutageresis (See, Gillman and Smith, 

Gene 8:81-97 (1979) and Roberts, S. et al. , Nature 328:731-734 
(1987)) . 

6. Production Qf Transgenic Non-human ftnimals 
15 The invention also encompasses methods and 

polynucleotide constructs which are employed for generating 
transgenic non-human animal z; which express the preprohormone 
polypeptides of the invention. The constructs are used to 
produce transgenic non-human mammals, birds (e.g., chickens) , 
20 fish and the like. Incorporation of nucleotide sequences of 

the invention allow modification of the reproductive and other 
characteristics of the transgenic animals. For example, fish 
that do not readily spawn in captivity can be induced to 
reproduce. 

25 Appropriate constructs and methods for production of 

transgenic animals cure known. Typically, the coding sequence 
of interest is operably linked to expression regulatory 
sequences. In such transgenes , the expression regulatory 
sequence is at least the minimal sequences required for 

30 efficient cell -type specific expression, which generally are 
at least a promoter and sequences upstream of the promoter, 
which provide for efficient expression in the target cells. 
Usually the sequences upstream of the promoter are used . 
contiguously, although various deletions and rearrangements 

35 can be employed. Some desired regulatory elements (e.g., 

enhancers, silencers) may be relatively position- insensitive, 
so that the regulatory element will function correctly even if 
positioned differently in a transgene than in the 
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corresponding germline gene. For example, an enhancer may be 
located at a different distance from a promoter, in a 
different orientation, and/or in a different linear order. 
For example, an enhancer that is located 3 1 to a promoter in 
5 germline configuration might be located 5' to the promoter in 
a transgene. 

Typically, expression regulation sequences are 
chosen to produce tissue -specific or cell type- specific 
expression of the recombinant DMA, Once a tissue or cell type 

10 is chosen for expression, expression regulation sequences are 
chosen. Generally, such expression regulation sequences are 
derived from genes that are expressed primarily in the tissue 
or cell type chosen. Preferably, the genes from which these 
expression regulation sequences are obtained are expressed 

15 substantially only in the tissue or cell type chosen, although 
secondary expression in other tissue and/ or cell types is 
acceptable if expression of the recombinant DNA in the 
transgene in such tissue or cell type is not detrimental to 
the transgenic animal. 

20 Particularly preferred expression regulation 

sequences are those endogenous to the species of a n i m a l to be 
manipulated. However, expression regulation sequences from 
other species such as those from human genes may also be used. 
In some instances, the expression regulation sequences and the 

25 recombinant DNA sequences (either genomic or cDNA) are from 
the same species. Alternatively, the expression regulation 
sequences recombinant DNA sequences (either cDNA or 
genomic) are obtained from different species. In such cases, 
the expression regulation and recombinant DNA sequence are 

30 heterologous to each other. 

In certain embodiments, it is desirable to use gene 
targeting, mediated by homologous rec om bi n ation between a 
targeting polynucleotide construct and a homologous 
chromosomal sequence, to replace an endogenous gene with the 

35 gene encoding a mutant of the preprohormone gene. Methods and 
materials for preparing such constructs are known by those of 
skill in the art are described in various references. 
See, e.g., Thomas et al.. Cell 51:503 (1987) and Capecchi, 
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Science 244:1288 (1989). Homologous targeting constructs have 
at least one region having a sequence that substantially 
corresponds to, or is substantially complementary to, a 
predetermined endogenous target gene sequence (e.g., an exon 
5 sequence, an enhancer, a promoter, an intronic sequence, or a 
flanking sequence of the target gene) . Such a homology region 
serves as a template for homologous pairing and recombination 
with substantially identical endogenous gene sequence (s) . In 
the targeting of transgenes, such homology regions typically 

10 flank the replacement region, which is a region of the 

targeting trans gene that is to undergo replacement with the 
targeted endogenous gene sequence. Thus, a segment of the 
targeting transgene flanked by homology regions can replace a 
segment of the endogenous gene sequence by double crossover 

15 homologous recombination. 

The constructs described above are introduced into 
pluripotent or totipotent cells using standard techniques. 
Briefly, this technology involves the insertion of the desired 
transgene construct into an appropriate cell line (e.g., 

20 mammalian embryonic stem (ES) cells or fertilized oocytes) 
that is capable of differentiating into germ cell tissue. 
Methods of introducing transgenes into embryonal target cells 
include microinjection of the transgene into the pronuclei of 
fertilized oocytes or nuclei of ES cells of the non-human 

25 animal. Such methods for murine species are well known to 

those skilled in the art. Alternatively, the transgene may be 
introduced into an animal by infection of zygotes with a 
retrovirus containing the transgene (Jaenisch, R. (1976) Proc. 
Natl. Acad. Sci. USA 73:1260-1264). 

30 The production of transgenic non-human mammals 

(e.g., mice, cows, pigs, and the like) are described in 
International Application No. WO/08216 and Krimpenfort, et 
al. , Biotechnology 9:844-847 (1991). Germ 1 s transgenes is 
of both chickens and Japanese quail has been .ascribed. For a 

35 general description of the production or transgenic birds. 
See, Shuman (1991) Bxperlentia 47:897-905. 

In certain preferred embodiments, transgenic fish 
are generated. A variety of exarples of the production of 
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transgenic fish have been reported (See, e.g., Guyamard et al. 
(1989) Biochimie 71:857-863 and Rokkones et al. (1989) J". 
Ccanp. Physiol. B. 158:751-785. Vectors can be constructed 
using mammalian promoters, e.g., from the murine 
5 metallothionen gene, or viral promoters, e.g., Rous sarcoma 
virus and simian virus 40 (see, e.g., Stuart et al. (1988) 
Development 103:403-412 and Yoon et al. (1990) Aquaculture 
85:21-33). Alternatively, vectors comprising entirely piscine 
regulatory sequences can be used (Liu et al. (1990) 

10 Bio/Technology 8:1268-1272) . 

The appropriate construct comprising an expression 
cassette containing the sequences encoding the preprohormone 
is introduced by microinjection into recently fertilized eggs 
isolated from spawning fish of the appropriate species (e.g., 

15 salmon, trout, and the like) . The injected embryos are reared 
in the appropriate medium and allowed to develop into mature 
fish using standard aqua cultural techniques. 

Tissue samples from the transgenic animals can be 
analyzed using standard techniques for detecting the presence 

20 or absence of a target sequence. For instance, Fluorescent In 
Situ Hybridization (FISH) can be used to detect the transgene. 
Several guides to FISH techniques are available, e.g.. Gall et 
al. Me tii. Enzywol., 21:470-480 (1981) and Angerer et al. in 
Genetic Engineering: Principles and Methods Setlow and 

25 Hollaender, Eds. Vol 7, pgs 43-65 (Plenum Press, New York 
1985) . 

The sequences can also be detected by PGR using 
primers and probes specific for the transgene. Standard PCR 
methods useful in the present invention are described in PCR 
30 Protocols: A Guide to Methods and Applications (Innis et al., 
eds. , Academic Press, San Diego). 

H. Pharmaceutical Compositions 

The preprohormone polypeptides (e.g., GAP peptides) 
35 of the present invention can also be administered to control 

reproductive function. Thus, the polypeptides can be prepared 
in pharmaceutical compositions and administered using methods 
well known in the art. For instance, the pharmaceutical 
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compositions can be administered orally with feed for the 
regulation of reproductive function in farm animals such as 
chickens. The compositions can also be suitable for 
parenteral administration. The pharmaceutical compositions 
5 can be administered in a variety of unit dosage forms 

depending upon the method of administration. For example, 
unit dosage forms suitable for oral administration include 
powder, tablets, pills, and capsules. 

Suitable pharmaceutical formulations for use in the 

10 present invention are found in Remington' 3 Pharmaceu ti cal 

Sciences, Mack Publishing Company, Philadelphia, PA, 17th ed. 
(1985) • A variety of pharmaceutical compositions comprising 
preprohormone polypeptides of the present invention and 
pharmaceutical^ effective carriers can be prepared. 

15 Pharmaceutical compositions of the invention include 

nucleic acid sequences encoding the preprohormone inserted 
into a suitable gene therapy vector. A variety of different 
gene therapy vectors may be used. Current strategies and 
vectors for gene therapy are reviewed in Miller A.D, (1992) 

20 mature, 357:455-460 and Mulligan, R.C. (1993) Science 260:926- 
932. The gene therapy vector may be delivered to an 
individual patient, typically by systemic administration 
(e.g., intravenous, intraperitoneal, intramuscular, subdermal, 
or intracranial infusion) . For systemic administration, 

25 injection is preferred, including intramuscular, intravenous, 
intraperitoneal, f*™* subcutaneous injection. Suitable 
formulations for injection are found in Remington ' s 
Pharmaceu ti cal Sciences, supra. The pharmaceutical 
compositions are suitable in a variety of drug delivery 

30 systems. For a brief review of present methods of drug 

delivery. See, Langer, Science 249:1527-1533 (1990) which is 
incorporated herein by reference. 
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I. Detection of Nucleic acids encoding fHis 5 . Trp 7 . Tyr 8 T -GnRH 
preprohormone or TSer e 1 -GnRH preprohormone polypeptides and 
detection of rHis 5 .Trp 7 .Tvr 8 1 -GnRH preprohormone or [Ser 8 ] - 
GnRH preproh ormone polypeptides bv immunoassays 
5 The present invention provides methods for detecting 

DNA or RNA encoding [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone or 
[Ser 8 ] -GnRH preprohormone and for measuring these 
preprohormones by immunoassay techniques. These methods are 
useful for two general purposes. First, assays for detection 

10 of nucleic acids encoding these preprohormones are important 
for the isolation these nucleic acids from a variety of 
species. As described above, it is known that the mature 
[His 5 , Trp 7 , Tyr 8 ] -GnRH hormone is present in a wide range of 
species (See Sherwood, N.M. et al., supra). Now that the 

15 [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone and [Ser 8 ] -GnRH cDNA's 

have been cloned and isolated, nucleic acids encoding these 
preprohormones may be isolated from a variety of species 
according to the methods described in section (B) above and by 
use of the nucleic acid hybridization assays described below. 

20 The immunoassays described below may be useful for isolation 
of nucleic acids encoding the [His 5 , Trp 7 , Tyr 8 ] -GnRH or [Ser 8 ] - 
GnRH preprohormones by expression cloning methods . See 
section (B) above and Sambrook, et al. 

The assays described below are also useful as In 

25 vitro diagnostic assays to determine the reproductive status 
of farm animals or of fish in aquacul ture. For example, 
testing of fish for [His 5 , Trp 7 , Tyr 8 ] -GnRH or [Ser 8 ] -GnRH 
preprohormone production at various times may play a role in 
the regulation of fish reproduction in aquacul ture. In 

30 addition, the discovery of the [His 5 , Trp 7 , Tyr 8 ] -GnRH mature 
hormone in placental mammals (See Sherwood, N.M. , supra) and 
the demonstration, herein, of the [His 5 , Trp 7 , Tyr 8 ] -GnRH 
preprohormone in the treeshrew suggests that the 
[His 5 , Trp 7 , Tyr 6 ] -GnRH preprohormone may also be produced in 

35 humans. Therefore, the assays described below may also be 
useful in tests for reproductive function in humans. 
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1. Nucleic Acid Hybridization Assays 

A variety of methods for specific DNA and RNA 
measurement using nucleic acid hybridization techniques are 
known to those of skill in the art. See Sambrook, et al. For 
5 example, one method for evaluating the presence or absence of 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone DNA or [Ser 8 ] -GnRH 
preprohormone DNA in a sample involves a Southern transfer. 
Briefly, the digested genomic DNA is run on agarose slab gels 
in buffer and transferred to membranes. Hybridization is 

10 carried out using the nucleic acid probes discussed above. As 
described above, nucleic acid probes are designed based on the 
known nucleic acid sequences encoding the [His 5 ,Trp 7 ,Tyr 8 ] - 
GnRH preprohormone or the [Ser 8 ] -GnRH preprohormone. 
Visualization of the hybridized portions allows the 

15 qualitative determination of the presence or absence of DNA 
encoding [His 5 , ??rp 7 ,Tyr 8 ] -GnRH preprohormone or the [Ser 8 ] - 
GnRH preprohormone. 

Similarly, a Northern transfer may be used for the 
detection of mRNA encoding [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone 

20 or the [Ser 8 ] -GnRH preprohormone. In brief, the mRNA is 

isolated from a given cell sample using an acid guanidinium- 
phenol - chloroform extraction method. The mRNA is then 
electrophoresed to separate the mRNA species and the mRNA is 
transferred from the gel to a nitrocellulose membrane. As 

25 with the Southern blots, labeled probes are used to identify 
the presence or absence of the [His 5 , Trp 7 ,Tyr 8 ] -GnRH or the 
[Ser 8 ] -GnRH preprohormone transcript. 

A variety of nucleic acid hybridization formats are 
known to those skilled in the art. For example, common 

30 formats include sandwich assays and competition or 

displacement assays* Hybridization techniques are generally 
described in "Nucleic Acid Hybridization, A Practical 
Approach,* Ed. Hames, B.D. and Higgins, S.J., IRL Press, 1985; 
Gall and Pardue (1969), Proc. Natl. Acad. Sci., U.S.A., 

35 63:378-383; and John, Burnsteil and Jones (1969) Nature, 
223:582-587. 

For example, sandwich assays are commercially useful 
hybridization assays for detecting or isolating nucleic acid 
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sequences. Such assays utilize a "capture" nucleic acid 
covalently immobilized to a solid support and labelled 
"signal" nucleic acid in solution. The clinical sample will 
provide the target nucleic acid. The "capture" nucleic acid 
5 ?tv3 "signal" nucleic acid probe hybridize with the target 

nucleic acid to form a "sandwich" hybridization complex. To 
be effective, the signal nucleic acid cannot hybridize with 
the capture nucleic acid. 

Typically labelled signal nucleic acids are used to 

10 detect hybridization. Complementary nucleic acids or signal 
nucleic acids may be labelled by amy one of several methods 
typically used to detect the presence of hybridized 
polynucleotides. The most common method of detection is the 
use of autoradiography with 3 H, 125 I, 35 S, 14 C, or 32 P- labelled 

15 probes or the like. Other labels include ligands which bind 
to labelled antibodies, f luorophores , chemiluminescent agents, 
enzymes, and antibodies which can serve as specific binding 
pair members for a labelled ligand. 

Detection of a hybridization complex may require the 

20 binding of a signal generating complex to a duplex of target 
AT i fl probe polynucleotides or nucleic acids. Typically, such 
binding occurs through ligand and anti- ligand interactions as 
between a ligand- conjugated probe and an anti- ligand 
conjugated with a signal. 

25 The label may also allow indirect detection of the 

hybridization complex. Por example, where the label is a 
hapten or antigen, the sample can be detected by using 
antibodies. In these systems, a signal is generated by 
attaching fluorescent or enzyme molecules to the antibodies or 

30 in some cases, by attachment to a radioactive label. 

(Tijssen, P., "Practice and Theory of Enzyme Immunoassays," 
Laboratory Techniques in Biochemistry and Molecular Biology, 
Burdon, R.H., van Knippenberg, P.H., Eds., Elsevier (1985), 
pp. 9-20.) 

35 The sensitivity of the hybridization assays may be 

enhanced through, use of a nucleic acid amplification system 
which multiplies the target nucleic acid being detected. 
Examples of such systems include the polymerase c h a in reaction 
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(PCR) system and the ligase chain reaction (LCR) system. 
Other methods recently described in the art are the nucleic 
acid sequence based amplification (NASBA™, Cangene, 
Mississauga, Ontario) and Q Beta Replicase systems. 
5 An alternative means for determining the level of 

expression of the [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone gene or 
the [Ser 8 ] -GnRH preprohormone is in situ hybridization. In 
situ hybridization assays are well known and are generally 
described in Angerer, et ai., Methods Enzymol . , 152:649-660 

10 (1987) . In an in situ hybridization assay, cells, 

pref erentially bovine lymphocytes are fixed to a solid 
support, typically a glass slide. If DNA is to be probed, the 
cells are denatured with heat or alkali. The cells are then 
ccitacted with a hybridization solution at a moderate 

15 temperature to permit annealing of labeled probes oecific to 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone or the [Ser 8 ] -GnRH 
preprohormone. The probes are preferably labelled with 
radioisotopes or fluorescent reporters. 



20 2^ Productio n of Antibodies and Development of 

In addition to detecting expression of the 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone or the [Ser 8 ] -GnRH 
preprohormone by nucleic acid hybridization, one can also use 

25 immunoassays to detect the preprohormones . Immunoassays can 
be used to qualitatively or quantitatively analyze for the 
preprohormone or to specifically detect the GAP peptide. A 
general overview of the applicable technology can be found in 
Harlow «nri Lane, Antibodies; A Laboratory Manual, Cold Spring 

30 Harbor Pubs., N.Y. (1988), incorporated herein by reference, 
a. Antibody Production 

A number of immunogens may be used to produce 
antibodies specifically reactive with the [His 5 ,Trp 7 ,Tyr 8 ] - 
GnRH preprohormone or the [Ser 8 ] -GnRH preprohormone. 
35 Recombinant preprohormone is the preferred immunpgen for the 
production of monoclonal or polyclonal antibodies. Naturally 
occurring preprohormone may also be used either in pure or 
impure form. Synthetic peptides made using the 
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[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone or the [Ser 8 ] -GnRH 
preprohormone sequences described herein may also used as an 
immunogen for the production of antibodies to the 
preprohormone . Preferentially , recombinant [His 5 , Trp 7 , Tyr 8 ] - 
5 GnRH preprohormone or [Ser 8 ] -GnRH preprohormone or fragments 
thereof are expressed in bacterial cells as described above, 
and purified as generally described above. The product is 
then injected into an animal capable of producing antibodies. 
Either monoclonal or polyclonal antibodies may be generated, 

10 for subsequent use in immunoassays to measure the 
preprohormone . 

Methods of production of polyclonal antibodies are 
known to those of skill in the art. In brief, an immunogen, 
preferably a purified protein, is mixed with an adjuvant and 

15 animals are immunized. The animal's immune response to the 

immunogen preparation is monitored by taking test bleeds and 
determining the titer of reactivity to the [His 5 , Trp 7 , Tyr 8 ] - 
GnRH preprohormone or the [Ser 8 ] -GnRH preprohormone. When 
appropriately high titers of antibody to the immunogen are 

20 obtained, blood is collected from the animal and antisera is 
prepared. Further fractionation of the antisera to enrich for 
antibodies reactive to the preprohormone can be done if 
desired. (See Harlow and Lane, supra) . 

Monoclonal antibodies may be obtained by various 

25 techniques familiar to those skilled in the art. Briefly, 

spleen cells from an animal immunized with a desired antigen 
are immortalized, commonly by fusion with a myeloma cell (See, 
Kohler and Milstein, Bur. «7. Immunol. 6:511-519 (1976), 
incorporated herein by reference) . Alternative methods of 

30 immortalization include transformation with Epstein Barr 

Virus, oncogenes, or retroviruses, or other methods well known 
in the art. Colonies arising from single immortalized cells 
are screened for production of antibodies of the desired 
specificity and affinity for the antigen, and yield of the 

35 monoclonal antibodies produced by such cells may be enhanced 

by various techniques, including injection into the peritoneal 
cavity of a vertebrate host. 
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Methods of production of synthetic peptides are 
known to those of skill i the art. Briefly, the predicted 
immunogenic regions of 3 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohorxnone or 
[Ser 8 ] -GnRH preprohormone sequences described herein are 
5 identified. For production of antibodies specific to the 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH or [Ser 8 ] -GnRH preprohormone GAP 
peptides, predicted immi logenic regions of the preprohormone 
sequence in the GAP region of the molecule are used (see Seq. 
ID No. 2 , Seq ID No. 18 and Seq. ID No 20. Peptides 

10 preferably at least 10 amino acids in length are synthesized 

corresponding to these regions and the peptides are conjugated 
to larger protein molecules for subsequent immunization. 
Production of monoclonal or polyclonal antibodies is then 
carried out as described above. 

15 b. Immunoassays 

Antibodies reactive with a particular protein can be 
measured by a variety of immunoassay methods. For a review of 
immunological anr» immunoassay procedures in general, See Basic 
ariri Clinical Immunology 7th Edition (D. Stites and A. Terr 

20 ed.) 1991. Moreover, the immunoassays of the present 

invention can be performed in any of several configurations, 
which are reviewed extensively in Enzyme Immunoassay, E.T. 
Maggio, ed., CRC Press, Boca Raton, Florida (1980); "Practice 
and Theory of Enzyme Immunoassays," P. Tijssen, Laboratory 

25 Techniques in Biochemistry and Molecular Biology, Elsevier 
Science Publishers B.V. Amsterdam (1985) ; and, Harlow and 
Lane, Antibodies, A Laboratory Manual, supra, each of which is 
incorporated herein by reference. 

Immunoas says for measurement of [His 5 ,Trp 7 ,Tyr 8 ] - 

30 GnRH preprohorxnone or [Ser 8 ] -GnRH preprohormone polypeptides 
can be performed by a variety to methods known to those 
skilled in the art. In brief, immunoassays to measure the 
preprohormones or tt GAP peptides can be either competitive 
or noncompetitive binding assays. In competitive binding 

35 assays, the sample analyte competes with a labeled analyte for 
specific binding sites on a capture agent bound to a solid 
surface. Preferably the capture agent is an antibody 
specifically reactive with the [His 5 , Trp 7 ,Tyr 8 ] -GnRH 
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preprohonnone or [Ser 8 ] -GnRH preprohormone produced as 
described above. The concentration of labeled analyte bound 
to the capture agent is inversely proportional to the amount 
of free analyte present in the sample. 
5 In a competitive binding immunoassay, 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone or [Ser 8 ] -GnRH 
preprohormone present in the sample competes with labelled 
preprohormone for binding to a specific binding agent, for 
example, an antibody specifically reactive with the 

10 preprohormone. The binding agent may be bound to a solid 

surface to effect separation of bound labelled preprohormone 
from the unbound labelled preprohormone. Alternately the 
competitive binding assay may be conducted in liquid phase and 
any of a variety of techniques known in the art may be used to 

15 separate the bound labelled preprohormone from the unbound 

labelled preprohormone. Following separation, the a mo unt of 
bound labeled preprohormone is determined. The amount of 
preprohormone present in the sample is inversely proportional 
to the amount of labelled preprohormone binding. 

20 Alternatively, a homogenous immunoassay may be 

performed in which a separation step is not needed. In these 
immunoassays, the label on the preprohormone is altered by the 
binding of the preprohormone to its specific binding agent. 
This alteration in the labelled protein results in a decrease 

25 or increase in the signal emitted by label, so that 

measurement of the label at the end of the immunoassay allows 
for detection or quantitation of the preprohormone. 

[His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone or [Ser 8 ] -GnRH 
preprohormone may also be determined by a variety of 

30 noncompetitive immunoassay methods . For example, a two- site, 
solid phase sandwich immunoassay is used. In this type of 
assay, a binding agent for the preprohormone, for example am 
antibody, is attached to a solid phase. A second 
preprohormone binding agent, which may also be an antibody, 

35 *tiH which binds the preprohormone at a different site, is 

labelled. After binding at both sites on the preprohormone 
hf*q occurred, the unbound labelled binding agent is removed 
anrf the amount of labelled binding agent bound to the solid 
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phase is measured. The amount of labelled binding agent bound 
is directly proportional to the amount of preprohormone in the 
sample* 

Western blot analysis can also be done to determine 
the presence of [His 5 ,Trp 7 ,Tyr 8 ] -GnRH or [Ser 8 ] -GnRH 
preprohormone in a sample. Electrophoresis is carried out, 
for example, on a tissue sample suspected of containing the 
preprohormone. Following electrophoresis to separate the 
proteins, and transfer of the proteins to a suitable solid 
support such as a nitrocellulose filter, the solid support is 
then incubated with an antibody reactive with the 
preprohormone. This antibody may be labeled, or alternatively 
may be it may be detected by subsequent incubation with a 
second labelled antibody that binds the ant i- preprohormone 
antibody. 

The immunoassay formats described above employ 
labelled assay components. The label can be in a variety of 
forms. The label may be coupled directly or indirectly to the 
desired component of the assay according to methods well known 
in the art. A wide variety of labels may be used. The 
component may be labeled by any one of several methods. 
Traditionally a radioactive label incorporating 3 H, 125 I, 35 S, 
14 C, or 32 P was used. Non- radioactive labels include ligands 
which bind to labelled antibodies, f luorophores, 
chemilumines cent agents, enzymes, and antibodies which can 
serve as specific binding pair members for a labelled ligand. 
The choice of label depends on sensitivity required, ease of 
conjugation with the compound, stability requirements, *nri 
available instrumentation. For a review of various labelling 
or signal producing systems which may be used, see U.S. Patent 
No. 4,391,904, which is incorporated herein by reference. 

This invention also embraces diagnostic kits for 
detecting the presence of [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone 
or [Ser 8 ] -GnRH preprohormone in tissue or blood samples which 
comprise a container containing antibodies selectively 
immunoreactive to the preprohormone and instructional material 
for performing the test. This invention further embraces 
diagnostic kits for detecting [His 5 ,Trp 7 ,Tyr 8 ] -GnRH or [Ser 8 ]- 
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GnRH preprohormone DNA or RNA in tissue or blood samples which 
comprise nucleic probes as described herein and instructional 
material . 



\ 
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EXAMPLES 

Example 1; Isolation of a cDNA encoding H. burton! 
f His 5 . Trp 7 . Tvr 8 1 -GnRH preprohormone 



5 Development of an oligonucleotide probe specific for 

H. burton! fHis 5 , Tro 7 . Tvx 8 1 -GnRH preprohormone cDNA 
To obtain nucleotide sequences encoding a 
[His 5 , Trp 7 ,Tyr 8 ] -GnRH- like peptide from H. burtoni, we 
employed a novel polymerase chain reaction (PCR) strategy 

10 using brain regions known to contain GnRH neuronal populations 
(see Pig. 4 for a schematic view of H. burton! brain) . 
Animals were sacrificed and whole brains removed as described 
in White, S.A. & Fernald, R.D. J. Neurosd. (1993) 13:434-441. 
The dorsal portion of the telencephalon, the optic tectum and 

15 cerebellum from each brain was removed. Total ENA was 

isolated from the remaining ventral portions by guanidine 
th i ocyana t e - acid phenol extraction and converted into cDNA 
using reverse transcriptase (Superscript, BRL; Gaithesburg, 
MD) . cDNA synthesis was primed with 0.5 /ig of a bipartite 

20 oligonucleotide consisting of a homopolymer of 9 d(T) residues 
at .the 3' end. The 5 V domain comprised a sequence of 14 
nucleotides which included a restriction endonuclease 
recognition site. The sequence of the oligonucleotide is: 5' 
GCAGAAGCTTCAGCT (9) 3' (Seq. ID No. 15). 

25 The resulting cDNA was used as substrate for nested 

PCR. In both rounds of amplification, the downstream primer 
was equivalent to the 5* domain of the bipartite 
oligonucleotide, described above. In the first PCR reaction, 
the upstream primer was a pool of 24 14-mers representing all 

30 possible coding sequences for the first five amino acids of 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH (see figure Kb)). 12.5 pmol of the 
upstream primer was used. After 40 cycles of 94°C/30 s; 
55°C/30s; 72°C/15s, 0.05 fil of the product of this reaction 
was aliquoted to a second reaction using 12.5 pmol of a new 

35 upstream primer together with the same downstream primer under 
the same cycling conditions. The new primer pool of 24 16- 
mers consisted of all possible coding combinations for 
residues 5-9 of the decapeptide (see figure 1(b)). Reactions 
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were performed using Taq polymerase (synthesized courtesy of 
Dr. R. Moses) on a Perkin- Elmer 9600 thennocycler. 
Oligonucleotides were synthesized as needed using an Applied 
Biosystems 391 instrument. 
5 PCR products were separated using a 1.5% GTG agarose 

gel (FMC, Rockville, ME, USA) and products of greater than 300 
bases were electrocuted and subcloned into M13 for sequencing 
as described in Bond, C.T., Francis, R.C., Pernald, R.D. & 
Adelman, J. P. (1991) J. Mol. Endocrinol. 5:931-937. 

10 Nucleotide sequences were analyzed for the presence of the 

second upstream primer (ending in the codon for amino acid 9; 
see Pig. 2), followed by a codon for the final conserved amino 
acid of the decapeptide, 10 Gly, and sequences for the 
canonical GnRH amidation and peptide processing site, 

15 ^ly^Lys^Arg. (See Douglass, J., Civelli, 0. & Herbert, E. 
(1984) Ann. Rev. Biochem. 53:665-715.) 

A radiolabeled nucleotide probe 29 bases in length 
was derived from a putative [His 5 , Trp 7 ,Tyr 8 ] -GnRH PCR product. 
The sequence of this oligonucleotide is 

20 GGGAATGCAGCTACCTGAGACCCCAGAGG (Seq. ID No. 16). 

BI Isolation of a full -length cDNA encoding the H. 
hurtioni rHis 5 .Trp 7 ,Tvr 8 l-GnRH oreprohormone 
To obtain full-length coding sequences, poly-adenylated 
25 RNA was isolated from H. imrtoni ventral brain regions (see 

above) and used to construct a cDNA library in XgtlO. 250,000 
primary recombinant were screened ( Genes creen; NEN-Dupont, 
Boston, MA, USA) using the radiolabeled oligonucleotide of Seq 
ID No. 16, described above. Positively hybridizing phage were 
30 purified by successive res creen s at reduced density. The cDNA 
inserts were subcloned into M13 phage and the nucleotide 
sequences determined. A cDNA containing the full-length 
coding sequence was subcloned into a transcription vector 
(Pselect, USB) for generation of sense and antisense 
35 riboprobes . 
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Example 2: Sequence Analysis of the H. burton! cDNA encoding 

the fgj-s 5 , Trp 7 , TVT B 1 -QpRH pyeprohormpnQ 

Six positively hybridizing clones were identified 
from the H. burton! ventral brain cDNA library as described 
5 above and the nucleotide sequences of the inserts were 

determined. The full-length cDNA sequence is shown _n Seq. ID 
No. 1, and the predicted amino acid sequence for the H. 
burton 1 chicken II GnRH preprohormone is shown in figure 2 
(Seq. ID No. 2) . The longest open reading frame containing 

10 the [His 5 , Trp 7 , Tyr 8 ] -GnRH coding sequence begins with a 
methionine initiator codon and predicts a 90 amino acid 
protein (Fig. 2) . The first 23 residues are largely 
hydrophobic and are likely a signal peptide as is found in 
many polyprotein neuroendocrine precursors (see Douglass, J., 

15 Civelli, 0. & Herbert, E. (1984) Ann. Rev. Biochem. 

53:665*715.) The signal sequence is in direct linkage with 
[His 5 , Trp 7 , Tyr 8 ] -GnRH, which is followed by Gly 11 Lys 12 Arg 13 . 
These residues follow the decapeptide sequence in all of the 
other cloned GnRH preprohormones and serve as substrates for 

20 post-translational processing {See Douglass, J. f et al . , 

supra) . The amino acid sequence of the [His 5 , Trp 7 , Tyr 8 ] -GnRH 
preprohormone (Seq. ID No. 3) is shown for comparison in 
Figure 2. The remainder of the precursor may code for two 
additional peptides also generated by proteolytic processing, 

25 neither of which show any homology to sequences in protein 

databases. One peptide, 28 residues in length, is followed by 
dibasic residues (arg-arg, see Fig. 2 legend) , and the 
remaining 19 amino acids could comprise a second processing 
product. There is only about 10% identity between the two H. 

30 burton 1 GnRH preprohormones when the GnRH motif is excluded 
(see Figure 2) . 

TCYampifi Northern blot anaiyff?, ? of H- burtoni brain tissue 
for the FHls 5 . Trp 7 . Tyr 8 ! -GnRH Precursor 
35 Northern blots were prepared as previously described 

(see Bond, C.T., Francis, R.C., Fernald, R.D. & Adelman, J. P. 
(1991) «7. Mol. Endocrinol. 5:931-937) and probed with 
antisense riboprobes synthesized with SP6 RNA polymerase, 
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incorporating 32 P- labelled UTP to a specific activity of 3 x 
10 8 dpm/jig. Hybridizations contained approximately 10 7 dpm 
per ml. High stringency conditions were obtained using 50% 
formamide at 65°C with washes in 0.1 x SSC at 70°C. 
5 Northern blot analysis of total RNA extracted from 

dorsal, ventral and whole H. burtoni brain revealed a single 
transcript of ca. 530 bases expressed in ventral and whole 
brain but absent from dorsal structures (Fig. 3).. 
Radiolabelled antisense riboprobes for both the • 

10 [His 5 ,Trp 7 ,Tyr 8 ] -GnRH and [Trp 7 ,Leu 8 ] -GnRH preprohormones were 
prepared and used to probe Northern blots of polyA+ RNA 
isolated from total brain at both high and low stringencies. 
The results (not shown) indicate that the probes for the two 
preprohormones do not cross -hybridize at either stringency 

15 with the alternative transcript. 

Bypmpl * a z in situ hybridization 

The site of [His 5 ,Trp 7 ,Tyr 8 ] -GnRH mRNA production in 
H. burton! . brain was assessed using in situ hybridization. 

20 Animals were sacrificed, brains removed and tissue fixed as 
previously described (See White, etal., J. Neurosci. (1993) 
13:434-44). 40 /im cryostat sections were mounted on poly 
L- lysine subbed slides for hybridization to the digoxygenin 
DTP (DIG, Boehringer Mannheim) -labeled [His 5 ,Trp 7 ,Tyr 8 ] -GnRH 

25 riboprobe . Hybridizations were carried out at high stringency 
(55°C in 50% formamide) according to the manufacturers 
instructions with the following modifications: 150 /il of 
hybridization solution was applied to slides containing 4 to 5 
brain sections. Following hybridization, 3 washes were done 

30 in 2 X SSC at 55°C for one hour each, followed by one wash in 
1 X SSC for 30 at roam temperature . To prepare the 
riboprobe, subcloned prepro [His 5 , Trp 7 , Tyr 8 ] - GnRH was 
linearized with Eco RI (GIBCO-BRL, Grand Island, New York, 
USA) , followed by filling in of recessed 3 • termini with the 

35 Klenow fragment of DNA polymerase I and Proteinase K 

treatment. The template DNA was purified using Magic DNA 
Clean-Up System (Promega Corporation, Madison, Wisconsin, USA) 
an^ 300 ng was used for transcription of a riboprobe with SP6 
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RNA polymerase (GIBCO-BRL) in the presence of DIG -conjugated 
UTP. Final concentrations in a lOOfil reaction volume were: 1 
X SP6 reaction buffer, 1 X NTP labeling mixture 
(Boehringer-Mannheim, Indianapolis, Indiana, USA), 4 • 0 ntM DTT 
80 units RNAsin (Promega) , 13.2 fig BSA, and 120 units of SP6 
polymerase. Solutions for both Northern analysis and in situ 
hybridizations were prepared using water treated with 0.075% 
diethylpyrocarbonate . 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH mRNA egression, localized 
using in situ hybridization, was found only in a cluster on 
neurons found in the mesencephalon (Fig. 4 (TOP) ) previously 
shown to contain GnRH using immunocytochemistry. Davis, M.R. 
& Fernald, R.D., J. Neurobiol. (1990) 21:1180-1188. 
Interestingly, [His 5 , Trp 7 ,Tyr 8 ] -GnRH is not expressed in the 
hypothalamic nucleus (Fig. 4 (LEFT COLUMN)) which projects to 
the pituitary hence probably does not directly influence 
gonadotropin activation. The probes for the two species of 
GnRH found in H. Jbirrtoni do not cross -react either when 
applied in situ (see Fig. 4) or in Northern blots (see above) 
Thus distinct forms of GnRH encoding genes are expressed in 
distinct brain regions. 

In H. burtani, as in other vertebrates, three 
neuronal populations have been shown to contain GnRH using 
immunocytochemistry: the te rm i n a l nerve, the 

hypothalamic/preoptic area and the mesencephalon. Davis, M.R 
& Fernald, R.D., J". Neurobiol. (1990) 21:11B0-1188; White, 
S.A. & Fernald, R.D., J. Neurosci. (1993) 13:434-441; see Fig 
4, TOP). Here we have shown that [His 5 , Trp 7 ,Tyr 8 ] -GnRH mRNA 
is expressed only in the mesencephalic population (Fig. 4, 
(LEFT COLUMN) ) . In contrast, [Trp 7 , Leu 8 ] -GnRH mRNA is 
localized only in the terminal nerve nucleus of the 
telencephalon (Fig. 4 (RIGHT COLUMN) ) . Furthermore, the lack 
of in situ hybridization within the GnRH-immunoreactive cells 
of the hypothalamic/preoptic area (Fig. 4) reveals that the 
GnRH- encoding gene expressed there is sufficiently different 
to elude detection by either probe. Thus, despite its potent 
releasing factor activity (cf. Ngamvongchon, S. Rivier, J.B. 
Sherwood, N.M. (1992), Regul. Pept. 42:63-73)), 



WO 95/12309 



48 



PCT/US94/12763 



[His 5 ,Trp 7 # Tyr 8 ] -GnRH is unl ikely to influence pituitary 
function directly in H. bxirtoni. Moreover, this suggests that 
yet a third gene encoding for GnRH may be responsible for 
regulating reproduction directly. 
5 The structure of the [His 5 , Trp 7 ,Tyr 8 ] -GnRH predicts 

two novel peptides in addition to it, in contrast to the 
single associated peptide predicted by [Trp 7 , Leu 8 ] -GnRH xriRNA 
(see Figure 2) . The lack of homology between the two known 
GnRH preprohormones in H. burton! suggests that these genes 

10 have either evolved independently or diverged from a common 
ancestor well before the appearance of teleost fish. 

The novel preprohormone structure and 
extra hypothalami r location of the H. Jburtoni [His 5 , Trp 7 , Tyr 8 ] - 
GnRH precursor may indicate a unique function for this GnRH 

15 form and/or its associated peptides. While hypothalamic GnRH 
clearly functions as a releasing hormone, neuramodulatory and 
neurotransmitter roles have been postulated for GnRHs in other 
brain regions (reviewed in Loumaye, E. f Thorner, J. & Catt, 
K.J. (1982) Science 218:1323-1325). The localization of the 

20 [His 5 , Trp 7 , Tyr 8 ] -GnRH mRNA to the mesencephalon supports 

previous immunological studies in which [His 5 , Trp 7 , Tyr 8 ] -GnRH- 
like peptides were found concentrated in caudal brain areas. 
Such mesencephalic GnRH- containing neurons may play a role in 
the central organization of reproductive behavior as suggested 

25 for poecilid fish where a similar neuronal population has been 
described. See Miller, K.E. & Kriebel, R.M. (1986) Gen. Camp. 
Endocrinol. 64:396-400. These cells contain GnRH and project 
to a spinal neurosecretory group which, in turn, has been 
postulated to play a role in gonadal duct contractility. 

30 As noted, only one GnRH peptide (previously called 

mammalian GnRH) , that which regulates the pituitary 
gonadotropes, has been demonstrated in more recently evolved 
mammals. This form can be detected in some bony fish, but is 
absent in H. Jburtoni as well as in reptiles and birds 

35 (reviewed in Loumaye, B. , Thorner, J. & Catt, K.J. (1982) 

Science 218:1323-1325, supra) . In contrast, [His 5 , Trp 7 , Tyr 8 ] - 
GnRH has widespread expression among vertebrates with the 
notable exception of more derived mammals. Since non- 
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mammalian animals express more than one GnRH peptide, it is 
possible that many mammals may express more than one form of 
GnRH, particularly the [Ser 8 ] -GnRH form. 

5 Example 5: Isolation of a cDNA encoding a treeahrew 
[Hip 5 , Trp 7 , Tyr 8 ] -QnRH pyeprohormone 

To isolate the sequence encoding the 
[His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone isoform in the treeshrew 
Tupaia bulangerl, we used a polymerase chain reaction (PCR) 

10 strategy similar to that used in the isolation of this isoform 
in a teleost fish, Haplochromis burtoni, as described Example 
1, herein. The amino acid sequence of the [ 5 His, 7 Trp, 
8 Try]GnRH decapeptide guided selection of primers for PCR 
amplification. Treeshrew brain mRNA was primed with a 

15 bipartite oligonucleotide for cDNA synthesis* The 3 1 domain 
of this oligonucleotide consisted of a homopolymer of d(T)24 
residue, while the 5' domain comprised a known sequence of 10 
nucleotides which included a restriction endonuclease 
recognition site. The resulting cDNA was used as substrate 

20 for nested PCR. The downstream primer in both rounds of 

amplification was bipartite. A randomly generated hexamer 
including all possible combinations of nucleotides at seven 
positions composed the 3* end and ten known nucleotides, 
including an Hind III restriction endonuclease recognition 

25 site, made up the 5' end. In the first PCR reaction, the 

upstream primer was a pool of 24 tetradecamers representing 
all possible coding sequences for the first five amino acids 
of [His 5 , Trp 7 , Tyr 8 ! -GnRH. The sequence of these primers is 
represented in the following two sequences : GCACGAATTCCA A/G 

30 CA T/C TGGTCNCA (Seq. ID No. 21) and GCACGAATTCCA A/G CA T/C 
TGGAG T/C CA (Seq. ID No. 22) . The product of the first PCR 
reaction served as substrate for nested reactions where the 
new upstream primer was bipartite. The 5' end of this primer 
contained a known restriction enzyme recognition sequence 

35 while the 3' end comprised a pool of 24 tetradecamers 

representing all possible coding combinations for residues 5-8 
of the decapeptide. These sequence this pool of primers is: 
GCACGAATTCCA T/C GGNTGGTA T/C CC (Seq. ID No. 23) . 
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PCR products were separated by electrophoresis in a 
2.5* GTG agarose gel (FMC) and products > 250 bases 
electrocuted and subcloned into M13 phage vector for sequence 
analysis (Bond et al., 1991). Nucleotide sequences were 
5 analyzed for the presence of the second upstream primer 
followed by the codons for the final amino acid of the 
decapeptide, Gly, and the next three amino acids Gly, Lys, and 
Arg. These residues follow the decapeptide sequence in the 
other cloned GnRH preprohormones and serve as substrates for 

10 posttranslational processing. One clone contained these 

landmarks followed by an open reading frame of 165 (or 219) 
nucleotides 55 (or 73) amino acids. The numbers in 
parentheses refer to the nucleotide and amino acid length if 
the alternate codon using methionine is used as the start 

15 codon. To confirm the identity of this clone as the 

[His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone , a downstream region of 
the clone was chosen from which to generate new primers for 5 1 
rapid amplification of cDNA ends (RACE) . See Frohman et al. 
in (1993) Methods in Enzymology Volume 218, Wu, R., editor, 

20 Academic Press, Inc., San Diego, California, USA for a more 
detailed description of RACE methodology. 

Substrate cDNA for the 5' RACE was synthesized from 
mRNA primed with randomly generated hexamers including all 
possible combinations of nucleotides at 7 positions. These 

25 cDNA products were tailed at their 5 1 ends with dATP in a 

terminal deaxytransf erase reaction (GIBCO BRL, Grand Island, 
New York, USA) . Two rounds of PCR amplification were 
performed on the tailed cDNA in which the 3' antisense primer 
consisted of a bipartite oligonucleotide (poly d(T)17 plus 

30 Hind III recognition site) designed to anneal to the 5 V 

terminal poly d(A)s. The outer 5' antisense primer for the 
first round of PCR consisted of a septadecamer corresponding 
to a region 111-127 nucleotides downstream of the decapeptide 
region. The oligonucleotide sequence for the outer primer was 

35 CAGGGCACACTGTCCTC (Seq. ID No. 24) . For the second round of 
amplification, the inner 5 f antisense primer was bipartite, 
composed of a known sequence of ten nucleotides including a 
restriction endonuclease cleavage site plus another 17 
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nucleotides 72-88 nucleotides downstream of the decapeptide 
region of the preprohormone candidate. The oligonucleotide 
sequence for the inner primer was GCACGAATTCCGAAGCTATGAGCAGTC 
(Seq. ID No. 25) . PCR products were prepared for sequencing 
5 (as above) and screened for the presence of the innermost 
primer followed by antisense sequences from the candidate 
clone and by antisense codons for the GnRH decapeptide. A 
majority of the generated clones were positive for the 
[His 5 ,Trp 7 ,Tyr 8 ] -GnRH encoding region and extended 
10 approximately 135 more nucleotides before ending in the poly 
d(T)/Hind III primer- sequence. Only one clone was isolated 
containing preprohormone sequence 3' to that region amplified 
by 5' RACE. 

15 Example 6: Sequence Analys is of the treeshrew cDNA encoding 
tfre fHis 5 ,Trp 7 f Tyx e 1 -GqRH pyeprghg^mone 

Only one clone was isolated in Example 5 which 
contained preprohormone sequence 3' to that region amplified 
by 5 1 RACE. To confirm the sequence, we designed a 

20 nondegenerate bipartite oligonucleotide whose 5' end consisted 
of ten nucleotides including a restriction endonuclease 
cleavage site and whose 3' end encoded residues 3-8 of the 
decapeptide region. The sequence of the oligonucleotide was 
GCJICX1AATTCGGTCCCACGGCTGGTAC (Seq. ID No. 26) . Poly d(T) 

25 primed cDNA was used as substrate in PCR amplification between 
the above oligonucleotide and a bipartite poly d(T)/Hind III 
primer downstream. Products were prepared, sequenced and 
screened as before resulting in many positive clones 
containing the same sequence as that found in the primary 

30 downstream clone and ending in polyadenylated tails. 

The fullest sequence of this preprohormone was 
obtained by aligning overlapping regions from at least ten 
consensus 5 1 RACE templates with at least ten consensus 
sequences generated in the first and second 3 s PCR 

35 amplifications. Sequences were aligned (Color Alignment 

Macros, M. Haygood) and differences attributed to PCR- error 
were assessed. The deduced amino acid sequence is shown in 
Seq. ID No. 18. As with GnRH preprohormones previously 
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isolated, a signal sequence precedes the decapeptide, followed 
by a conserved proteolytic processing site and an associated 
peptide. (See Comparative Biology and Evolutionary 
Relationships of Treeshrews, W.P. Luckett, ed. , Plenum Press, 
5 New York (I960).) Treeshrews are part of the super order 
Arcbanta, which includes primates, flying lemurs, bats and 
elephant shrews. Treeshrews are thus very closely related to 
primates, yet primitive. As a consequence, they are very 
frequently used to discover information about molecules, 
10 processes, etc., which may link much more primitive rodents 

with primates. Thus, [Ser 8 ] -GnRH preprohormone may be present 
in primates including humans. 

Example 7: Isolation of a cDNA e ncoding H. burtoni fSer 8 ] -GnRH 

15 preprohormone 

To obtain nucleotide sequences encoding a [Ser 8 ] - 
GnRH- like peptide from H. burtoni, a novel polymerase chain 
reaction (PGR) strategy using brain regions known to contain 
GnRH neuronal populations was employed (see Fig. 4 for a 

20 schematic view of H. burtoni brain) . Animals were sacrificed 
and whole brains removed as described in White, S.A. & 
Fernald, R.D. J* Neurosci. (1993) 13:434-441. tf. burtoni mRNA 
was isolated from the preoptic area, brain tissue known to 
contain the GnRH neuronal population which projects to the 

25 pituitary. cDNA synthesis was primed with a bipartite 

oligonucleotide whose 3 1 domain consisted of a homopolymer of 
d(T)9 residues, while the 5 1 domain comprised a known sequence 
of 14 nucleotides which included a restriction endonuclease 
recognition site. The resulting cDNA was used as substrate 

30 for nested PCR. In both rounds of amplification, the 

downstream primer was as described above. In the first PCR 
reaction, the 5 f end of the primer contained a known 
restriction enzyme recognition sequence and the 3 1 sequence 
was a mixture of 48 tetradecamers representing all possible 

35 coding sequences for the first five amino acids of [Ser 8 ] GnRH. 
(See Powell, J.F.F., et al. Regulatory Peptides, su bm itted.) 
The sequence of the this pool of oligonucleotides was as 
follows: GCACGAATTCCA A/G CA T/C TGGTCNTA T/C GG (Seq. ID No. 
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27) . The product of this reaction served as substrate for 
nested reactions where the new upstream primer was bipartite. 
The 5' end of this primer also contained a known restriction 
enzyme recognition sequence while the 3 1 end comprised a pool 
5 of 384 tetradecamers representing all possible coding 
combinations for residues 5-8 of the decapeptide. The 
sequences of these oligonucleotides are represented in the 
following two sequences: GCACGAATTCTA T/C GGN C/T TNTCNCC 
(Seq. ID No. 28) and GCACGAATTCTA T/C GGN C/T TNAG C/T CC 

10 (Seq. ID No. 29) . 

PCR products were separated by electrophoresis in a 
2% GTG agarose gel (EMC Bioproducts, Rockville, Maine, USA) 
and products >250 bases were electroeluted and subcloned into 
M13 phage vector for sequence analysis (see Bond et al. (1991) 

15 J. Mol. Endocrinol. , 5:931-937). Nucleotide sequences were 
analyzed for the presence of the second upstream primer 
followed by the codons for the final amino acid of the 
decapeptide, glycine, and the following three amino acid 
residues gly, lys, arg (see figure 2) . These residues follow 

20 the decapeptide sequence in the other cloned GnRH 

preprohormones and serve as substrates for posttranslational 
processing. Several independent clones from 3 separate 
first-round and 7 second-round PCR reactions contained these 
landmarks followed by an open reading frame of 201 nucleotides 

25 (67 amino acids) . To confirm the identity of these clones as 
the [Ser 8 ] -GnRH preprohonnone, templates were visually 
compared after alignment in a spreadsheet (Color Alignment 
Macros, M. Haygood, Scripps Institute of Oceanography, San 
Diego, California, USA) and a region of consensus was chosen 

30 from which to generate new primers for 5 ' rapid amplification 
of cDNA ends (RACE) . See Prohman et al. in (1993) Methods in 
Enzymology Volume 218, Wu, R. , editor, Academic Press, Inc., 
San Diego, California, USA for a more detailed description of 
RACE methodology. 

35 The bipartite primer used to make the substrate cDNA 

for the 5 1 RACE reactions consisted of a randomly generated 
hexamer including all possible combinations of nucleotides at 
7 positions. These cDNA products were tailed at their 5' ends 
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with dATP in a terminal deoxytransf erase reaction (GIBCO BRL, 
Grand Island, New York, USA) . Two rounds of PCR amplif ication 
were performed on the tailed cDNA in which the 3' antisense 
primer consisted of a bipartite oligonucleotide (poly d(T)17 
plus Bam HI recognition site), designed to anneal to the 5' 
terminal poly d(A)s. The outer 5' antisense primer for the 
first round of PCR consisted of 18 nucleotides corresponding 
to a region of high consensus amongst the candidate 
preprohormone templates. The sequence of this primer is 
CATATTGCCCAGTGTGTC (Seq. ID No. 30) . For the second round of 
an?>lif ication, the inner 5' antisense primer was bipartite, 
canqposed of a known sequence of 14 nucleotides including a 
restriction endonuclease cleavage site plus another 18 
nucleotides from the preprohormone candidate which were 3' to 
but did not overlap with the first selected sequence. The 
nucleotide sequence of this primer is 

GCAQ3AATT0GTGTCTGAGAAGTTGTCC (Seq. ID No. 31) . PCR products 
were prepared for sequencing as described above, and were 
screened for the presence of the innermost primer followed by 
antisense sequences from the candidate templates and by 
antisense codons for the GnRH decapeptide itself. 

Example 8: Sequenc e Analysis of the CDNA encoding ff, jflirtpni , 
TSer 8 T "GnRH prepro hormone 

Approximately 80% of the generated clones were 
positive for the [Ser 8 ] -GnRH- encoding region and extended 
approximately 140 more nucleotides before ending in the poly 
d(T)/BamHI primer sequence. The fullest sequence of this 
preprohormone was obtained by aligning overlapping regions 
from consensus 5' RACE templates with consensus sequences 
generated in the primary PCR amplification. Sequences were 
aligned (Color Alignment Macros) and differences attributed to 
PGR- error were assessed. The sequence of the full length cDNA 
is shown as Seq. ID No. 19 and the deduced amino acid sequence 
is shown as Seq. ID. No. 20. In each case, a signal sequence 
precedes one of the decapeptide forms, followed by a 
conserved proteolytic processing site and an associated 
peptide. 
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Example 9: Localization of fSer 8 1 -GnRH preprohormone mRNA 
expression in H. pur-hnni brain sections bv in situ 
hybriflizfrtApn 

The identity of the [Ser 8 ] -GnRH preprohormone 
5 obtained above as that encoding the GnRH contained within 
preoptic area GnRH-ir neurons which govern gonadotropin 
secretion from the pituitary was confirmed through in situ 
analysis on H. burton i brain sections. These neurons are 
known to enlarge or shrink in size depending on the social and 

10 reproductive state of the fish (see Davis & Fernald (1991) «7. 
Neurobiol. 21:1180-1188). Tissue sections were prepared from 
both a dominant and a subordinate animal, which contain either 
large or small GnRH immunoreactive (GnRH-ir) neurons, 
respectively. Social status was determined by behavioral 

15 observations. Animals were sacrificed, brains removed and 
tissue fixed as previously described in Example 4, herein. 
Cryostat sections (40/zxn) were mounted on poly (L- lysine) -coated 
slides for hybridization. 

A hybridization probe was generated by PCR 

20 amplification of a consensus 5 1 RACE product cloned into M13 
using lac and rev lac primers. Following gel electrophoresis 
performed as described in Example 7, herein, a single band of 
the correct size (ca. 210 bases) was electroeluted and 
fragments were subcloned into pSK+ (Stratagene, San Diego, 

25 California, USA) . Double- stranded sequencing confirmed that 

several clones contained the 5' RACE insert and one was chosen 
as a template for generation of an antisense digoxygenin - DTP 
(Boehringer Mannheim, Indianapolis, Indiana, USA) riboprdbe. 
The riboprobe was prepared as follows: The plasmid was 

30 linearized with BamHT (GIBCO/BRL, Grand Island, New York, USA) 
and purified using the Wizard clean-up system from Pr omega, 
Madison, Wisconsin, USA) . 300 ng was used for transcription 
of an RNA probe with T7 polymerase (GIBCO/BRL, Grand Island, 
New York, USA) and a lxNTP labelling mixture 

35 (Boehringer-Mannheim, Indianapolis, Indiana, USA) as described 
in Example 4, herein* 

Hybridizations were carried out at high stringency 
(60°C in 60% formamide) under conditions slightly modified 
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from those suggested by the digoxygenin manufacturer 
(Boehringer Mannheim, Indianapolis, Indiana, USA) . 150 fil of 
hybridization solution was applied to slides containing five 
to six brain sections. After hybridization, 2 washes were 
5 done in 2x SSC, followed by washes in 1.5x and lx SSC 

respectively at 60°C with a final wash at room temperature in 
lx SSC. All washes were for 20 minutes. Slides were 
dehydrated, coverslipped and viewed using a light microscope. 

Figure 5 shows a panel of midsagittal sections from 

10 H. burton! brain, focusing on the three regions which contain 
GnRH-ir cell populations (see Davis & Fernald (1990), supra). 
The top three regions were hybridized to the [Ser 8 ] -GnRH 
riboprobe as described above. Within this top tier, the first 
section is split, showing preoptic area cells labeled with the 

15 [Ser 8 ] -GnRH riboprobe from both a dominant and a subordinate 
male fish. They reveal that the [Ser 8 ] -GnRH transcript is 
expressed in the preoptic region within a cell population that 
exhibits neuronal size plasticity correlated with social 
state. The bottom two panels show hybridization of the 

20 riboprobe in the terminal nerve area and the mesencephalon 

regions of the H. burton! brain of dominant males. Thus, in 
H. burton!, the three different genes encoding GnRH are 
expressed in three distinct regions of the brain. 

While not wishing to be bound by theory, the in situ 

25 hybridization data supports an important reproductive role for 
the hypothalamic -preoptic form of GnRH, [Ser 8 ] -GnRH, in the 
cichlid fish H. burton!. In this species, preoptic GnRH-ir 
cells grow and shrink with social and reproductive state. 
Dominant males have large GnRH- producing cells, large testes, 

30 and are reproductively active. Subordinate males have small 
GnRH-producing cells, small testes and do not mate. Preoptic 
GnRH gene expression may therefore not only reflect the 
reproductive state of the animal, but may also be responsive 
to social cues. 

35 Also while not wishing to be bound by theory, the 

distribution of [His 5 , Trp 7 ,Tyr 8 ] -GnRH mRNA in H. Burton! 
brain, coupled with other information, suggests that this form 
of GnRH may coordinate reproductive behavior with reproductive 
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state. [His 5 , Trp 7 , Tyr 8 ] -GnRH is the most widespread and 
ancient GnRH isoform, having been immunologically detected in 
caudal brain regions of all non-mammalian vertebrates studied 
to date, often localized to a midbrain neuronal population. 
5 While in goldfish these cells project to the median eminence 
and result in the delivery of [His 5 , Trp 7 , Tyr 8 ] GnRH (in 
addition to the hypothalamic form) to the pituitary, this 
isoform is not present in the pituitary of H. burtoni. 
Neither is it widely detected in the pituitaries of other 

10 species* This fact stands in contrast to the marked efficacy 
of the [His 5 , Trp 7 , Tyr 8 ] GnRH isoform in causing release of 
gonadotropins in cultures of dispersed pituitary cells. Thus, 
while this isoform is capable of acting at pituitary GnRH 
receptors, it seldom has the in vivo opportunity to 

15 d emo nstrate its functional potency. However, because of this 
biological activity, nucleic acids encoding the 
[His 5 , Trp 7 , Tyr 8 ] GnRH precursor, as well as the nucleic acids 
encoding the [Ser 8 ] -GnRH precursor, may be useful in the 
production of transgenic animals, including transgenic fish. 

20 

Bygmple 9: Northern blot analysis of H. burton! brain tissue 

for tte rser 8 ] -gnRH p^curpc^ 

To determine the size of the [Ser 8 ] -GnRH mRNA 
precursor relative to the sizes of the other GnRH 

25 preprohormones in H. Jburtoni, Northern analyses were performed 
at high stringency as described in Example 3, herein. 
Poly (A) + RNA was isolated from ventral and dorsal brain 
regions using the PastTrak kit from Invitrogen Corporation, 
San Diego, California, 1.5 fig aliguots were loaded onto 

30 replicate lanes of a denaturing 2.5% agarose gel for size 
separation. The gel was prepared as a Northern blot using 
capillary transfer. Individual lanes were subjected to 
high- stringency hybridizations (50% formamide, 62°C) with 
riboprobes for each of the 3 GnRH preprohormones in Jf. 

35 burton 1 , either alone and in combination. Riboprobes were 

synthesized with T7 polymerase (GIBCO BRL, Grand Island, New 
York, USA), incorporating [a-32P]UTP (Amersham Corporation, 
Arlington Heights, Illinois, USA) to a specific activity of 3 
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X 108 dpm//ig. Hybridization mixtures contained 107 dpm per 
ml. The [Ser 8 ] -GnRH riboprobe labeled a band of approximately 
550 bases while the size of the [Trp 7 , Leu 8 ] GnRH and 
[His5,Trp7,Tyr8]GnRH transcripts are approximately 500 and 590 
5 bases , respectively . 

The above examples are provided to illustrate the 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill in 
10 the art and are encompassed by the appended claims. All 

publications, patents, and patent applications cited herein 
are hereby incorporated by reference. 



WO 95/12309 



59 



PCT/US94/12763 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: The Board of Trustees of Leland Stanford, Jr. 
University, 
And 

State of Oregon, acting by and through The 
Oregon State Board of Higher Education on behalf 
of The Oregon Health Sciences University 



(B) STREET: 900 Welch Road, Suite 350 

(C) CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(P) POSTAL CODE (ZIP): 94304-1858 

(G) TELEPHONE: (415) 723-0651 

(H) TELEFAX: (415) 725-7295 

(I) TELEX: 

(ii) TITLE OP INVENTION: Nucleic acids Encoding 

[His-5,Try-7,Tyr-8] -GnRH Preprohorxnone and 
[Ser-8] -GnRH preprohorxnone and 
Their Uses 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CUMPITI'KU READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO not yet designated 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/147,771 

(B) FILING DATE: 05 -NOV- 19 9 3 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Weber, Kenneth A. 

(B) REGISTRATION NUMBER: 31,677 

(C REFERENCE /DOCKET NUMBER: 14210- 0004 00PC 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 543-9600 

(B) TELEFAX: (415) 543-5043 



(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Haplochromis burtoni 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 135- . 389 

(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 243.. 389 

(D) OTHER INFORMATION: /notes "Nucleotide sequence that 
encodes the GAP peptide." 

(ix) FEATURE: 

(A) NAME/KEY: mi e cofeature 

(B) LOCATION: 1. .563 

(D) OTHER INFORMATION: /note= "cDNA sequence that contains 
nucleotide sequence that encodes 
GnRH{5-Hie, 7-Trp, 8 -Tyr} from Haplochromis 

(xi) SEQUENCE DESCRIPTION: SBQ ID N0:1: 

GACTGTCAGC GCAACTGGAT TTTAGCACIA AATCCACCAA AGGAAAAGAA CATTTTGAAG 60 

TGAACCCCTG CAGGTACACT GAGGGAAACT TGGACTGATA AAGCTGTGAA ATCTAAGACT 120 

AAGGTGGGAA TATC ATG TGT GTG TCT CGA CTG GCT TIG CTC TTG GGG CTG 170 
Met Cys Val Ser Arg Leu Ala Leu Leu Leu Gly Leu 
15 10 

CTT CTC TGT GTG GGG GCT CAG CTG TCC TTT GCC CAG CAC TGG TCC CAT 218 
Leu Leu Cys Val Gly Ala Gin Leu Ser Phe Ala Gin His Trp Ser His 
15 20 25 

GGT TGG TAT CCT GGA GGA AAA AGG GAG CTG GAC TCC TTT GGC ACA TCA 266 
Gly Trp Tyr Pro Gly Gly Lys Arg Glu Leu Asp Ser Phe Gly Thr Ser 
30 35 40 

GAG ATT TCA GAG GAG ATT AAG CTG TGT GAA GCA GGG GAA TGC AGC TAC 314 
Glu lie Ser Glu Glu lie Lys Leu Cys Glu Ala Gly Glu Cys Ser Tyr 
45 50 55 60 

CTG AGA CCC CAG AGG AGG AGT ATC CTG AGA AAC ATT CTT CTG GAT GCC 362 
Leu Arg Pro Gin Arg Arg Ser lie Leu Arg Asn lie Leu Leu Asp Ala 
65 70 75 

TTA GCC AGA GAG CTT CAG AAG AGA AAG TGACATCTTT CCAGAGCCTC 409 
Leu Ala Arg Glu Leu Gin Lys Arg Lys 
80 85 

TTTTCTATAG TAACCCACTT CCCTTTGTAT TTCTGCCTTG ACGTGATTTT GTGATCATCT 469 

GGCCr r UCTG TTTGTAATGT TTGTCAGTAA ATTTGTCCTG TTTTTTTCGA TGTGAAAATT 529 

G T GT CCCA AA ATAAATATCT ATTTTTATAT TAAA 563 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO. 2: 

Met Cye Val Ser Arg Leu Ala Leu Leu Leu Gly Leu Leu Leu Cys Val 
1 5 10 15 

Gly Ala Gin Leu Ser Phe Ala Gin His Trp Ser His Gly Trp Tyr Pro 
20 25 30 

Gly Gly Lys Arg Glu Leu Asp Ser Phe Gly Thr Ser Glu lie Ser Glu 
35 40 45 

Glu lie Lys Leu Cys Glu Ala Gly Glu Cys Ser Tyr Leu Arg Pro Gin 
50 55 60 

Arg Arg Ser lie Leu Arg Asn He Leu Leu Asp Ala Leu Ala Arg Glu 
65 70 75 80 

Leu Gin LyB Arg Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haplochromis burtoni 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCAT ION: 1. .90 

<D) OTHER INFORMATION: /note= "Name: 
PreproGnRH{7-Trp, 8 -Leu} . n 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:3: 

Met Glu Ala Gly Ser Arg Val He Met Gin Val Leu Leu Leu Ala Leu 
15 10 15 

Val Val Gin Val Thr Leu Ser Gin His Trp Ser Tyr Gly Trp Leu Pro 
20 25 30 

Gly Gly Lys Arg Ser Val Gly Glu Leu Glu Ala Thr He Arg Met Met 
35 40 45 

Gly Thr Gly Gly Val Val Ser Leu Pro Asp Glu Ala Asn Ala Gin He 
50 55 60 

Gin Glu Arg Leu Arg Pro Tyr Asn He He Asn Asp Asp Ser Ser His 
65 70 75 80 

Phe Asp Arg Lys Lys Arg Phe Pro Asn Asn 

85 9: 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mammal 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .10 

(D) OTHER INFORMATION: /note= "GnRH" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 : 

Glu His Trp Ser Tyr Gly Leu Arg Pro Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chicken (I) 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 • . 10 

(D) OTHER INFORMATION: /note= "GnRH{8-Gln} 9 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Glu His Trp Ser Tyr Gly Leu Gin Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmon 

(ix) FEATURE: 

(A) HAME/KEY: Peptide 

(B) LOCAT ION: 1..10 

(D) OTHER INFORMATION: /notes n GnRH{7-Trp, 8 -Leu} 0 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

Glu His Trp Ser Tyr Gly Trp Leu Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Catfish 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION : /note= n GnRH{5-Hie, 8-Asn} 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu His Trp Ser His Gly Leu Asn Pro Gly 
15 10 

(2) INFORMATION FOR SEQ 3D NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
.(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chicken (II) 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .10 

(D) OTHER INFORMATION: /note= »GnRH{5-His, 7 -Trp, 8-Tyr}" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu His Trp Ser His Gly Trp Tyr Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) fSDUSCULB TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dogfish 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 
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(B) LOCATION: 1..10 

(D) OTHER INFORMATION : /note= °GnRH{5-His, 7-Trp, 8-Leu} " 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

Glu His Trp Ser His Gly Trp Leu Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CH ARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lamprey 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCAT ION: l.» 10 

(D) OTHER INFORMATION: /note= 

"GnRH{3-Tyr,5-Leu,6-Glu, 7-Trp, B-Lys}° 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Glu His Tyr Ser Leu Glu Trp Lys Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lamprey 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCAT I ON: 1.. 10 

(D) OTHER INFORMATION: /note= 

"GnRH{ 3 -Tyr , 5 -His , 6 -Asp , 7 -Tip , 8 -Lys } ° 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Glu His Tyr Ser His Asp Trp Lys Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alpha -yeas t mating factor 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His Trp Leu Glu Leu Lys Pro Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
CARCAYTGGT CNCA 

(2) INFORMATION FOR SEQ ID NO:14: 

( i ) SEQUEN CE CH ARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 
CAYGGNTGGT AYCC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCAGAAGCTT CAGCT 
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(2) INFORMATION FOR SEQ ID NO: 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGAATGCAG CTACCTGAGA CCCCAGAGG 29 
(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 72.. 338 

(D) OTHER INFORMATION: /products "The 

[His-5,Trp-7 # Tyr-8] -GnRH prepronormone from 
Tree shrew" 

(ix) FEATURE: 

(A) NAME /KEY: mi s cofeature 

(B) LOCATION: 60.. 62 

(D) OTHER INFORMATION: /note= "Encodes potential second 
initiator methionine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTCGAGCCAG AAGGAGCCAC CCZACCTAXA GCCTCTTCCA CIGCTGTCCC CGTCCI6CCA 60 

TGGCCAGTTC C ATG CTG GGC TTC CTC CTC CTG CTG CTG CTG CTG ATG GCT 110 
Met Leu Gly Phe Leu Leu Leu Leu Leu Leu Leu Met Ala 
15 10 

GCC GAC CCT GGA CCC TOG GAG GCC CAG CAT TGG TCC CAC GGC TGG TAC 158 
Ala His Pro Gly Pro Ser Glu Ala Gin His Trp Ser His Gly Trp Tyr 
15 20 25 

CCT GGA GGA AAG CGA GCC TCC AAC TCA CCC CAG GAC CCT CAA AGT GCC 206 
Pro Gly Gly Lys Arg Ala Ser Asn Ser Pro Gin Asp Pro Gin Ser Ala 
30 35 40 45 

CTT AGG CCC CCA GCC CCC AGC GCA GCC AGA CTG CTC ATA GCT TCC GAA 254 
Leu Arg Pro Pro Ala Pro Ser Ala Ala Arg Leu Leu lie Ala Ser Glu 
50 55 60 

GCG CTG CTC TGG CTT CCC CCG AGG ACA GTG TGC CCT GGG AGG GCA GGA 302 
Ala Leu Leu Trp lieu Pro Pro Arg Thr Val Cys Pro Gly Arg Ala Gly 
65 70 75 
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CCA CAG CAG GAT 6GT CTC TCC GCA GGA AGC AHr ACC TGATGCGGAC 348 
Pro Oln Gin Asp Gly Leu Ser Ala Gly Ser s.-. ™hr 
80 85 

ACTGCTGAGC GCAGCCGGAG CGCCGCGCCC CGCCGCCGTL JCAATAAAGC CGTGAGATTC 408 

CCGAAAAAAA AAAA 422 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Gly Phe Leu Leu Leu Leu Leu Leu Leu Met Ala Ala His Pro 
1 5 10 15 

Gly Pro Ser Glu Ala Gin His Trp Ser His Gly Trp Tyr Pro Gly Gly 
20 25 30 

Lys Arg Ala Ser Asn Ser Pro Gin Asp Pro Gin Ser Ala Leu Arg Pro 
35 40 45 

Pro Ala Pro Ser Ala Ala Arg Leu Leu lie Ala Ser Glu Ala Leu Leu 
50 55 60 

Trp Leu Pro Pro Arg Thr Val Cys Pro Gly Arg Ala Gly Pro Gin Gin 
65 70 75 80 

Asp Gly Leu Ser Ala Gly Ser Ser Thr 
85 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUE NCE CH ARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 71.. 364 

(D) OTHER INFORMATION: /products "The [Ser- 8] -GnRH 
preprohormone from H. burtoni" 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 47.. 49 

(D) OTHER INFORMATION: /note= "Encodes potential second 
initiator methionine . 0 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTAGACTTCA CAAAGGACAG CAGAAGAGAT CAGAAGTTCT TGTCTAATGC ACAGAAGCTT 60 
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TATCCTCAGA ATG GCT GCA AAA ATC TTG GCA CTG TGG CTG CTG CTC GCA 109 

Met Ala Ala Lys He Leu Ala Leu Trp Leu Leu Leu Ala 

15 10 

GGG ACG GTG TTT CCA CAG GGC TGC TGT CAG CAC TGG TCA TAC GGA CTG 157 
Gly Thr Val Phe Pro Gin Gly Cys Cye Gin His Trp Ser Tyr Gly Leu 
15 20 25 

AGC CCA GGA GGG AAG AGG GAT CTG GAC AAC TTC TCA GAC ACA CTG GGC 205 
Ser Pro Gly Gly Lys Arg Asp Leu Asp Asn Phe Ser Asp Thr Leu Gly 
30 35 40 45 

AAT ATG GTT GAA GAG TTC CCA CGC GTC GAA GCA CCT TGC AGT GTT TTC 253 
Asn Met Val Glu Glu Phe Pro Arg Val Glu Ala Pro Cys Ser Val Phe 
50 55 60 

GGT TGT GCA GAG GAA TCA CCT TTT GCC AAA ATG TAC AGA GTG AAA GGA 301 
Gly Cys Ala Glu Glu Ser Pro Phe Ala Lys Met Tyr Arg Val Lys Gly 
65 70 75 

CTT CTT GCG AGT GTG GCC GAA AGG AAA ATG GAC ACC GGA CAT TCA AGA 349 
Leu Leu Ala Ser Val Ala Glu Arg Lys Met Asp Thr Gly His Ser Arg 
80 85 90 

AAT GAA AGA TTT CTT TGATTCTACA TTTCATTTTT TATATGAGCA TAAAACATTT 404. 
Asn Glu Arg Phe Leu 
95 

TTTTGTGAAT GTTGCTCTTG TCTTATTATC TAAAATATAA ATAAAAGCTT TCAACTCACT 464 
GAAAAAAAAA AAAAAAAAAA AAAAACC 491 



(2) INFORMATION FOR SEQ XD NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Ala Ala Lys He Leu Ala Leu Trp Leu Leu Leu Ala Gly Thr Val 
15 10 15 

Phe Pro Gin Gly Cys Cys Gin His Trp Ser Tyr Gly Leu Ser Pro Gly 
20 25 30 

Gly Lys Arg Asp Leu Asp Asn Phe Ser Asp Thr Leu Gly Asn Met Val 
35 40 45 

Glu Glu Phe Pro Arg Val Glu Ala Pro Cys Ser Val Phe Gly Cys Ala 
50 55 60 

Glu Glu Ser Pro Phe Ala Lys Met Tyr Arg Val Lys Gly Leu Leu Ala 
65 70 - 75 80 

Ser Val Ala Glu Arg Lys Met Asp Thr Gly His Ser Arg Asn Glu Arg 
85 90 95 



Phe Leu 
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(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCACGAATTC CARCAYTGGT CNCA 24 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCACGAATTC CARCAYTGGA GYCA 24 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GCACGAATTC CAYGGNTGGT AYCC 24 
(2) INFORMATION FOR SEQ ID NO:_4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CAGGGCACAC TGTCCTC 



17 
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(2) INFORMATION FOR SEQ ID 190:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GCACGAATTC CGAAGCTATG AGCAGTC 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GCACGAATTC GGTCCCACGG CTGGTAC 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27 
GCACGAATTC CARCAYTGGT CRTAYGG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GCACGAATTC TAYGGNYTNT CNCC 
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(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GCACGAATTC TAYGGNYTNA GYCC 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CAXATTGCCC AGTGTGTC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 31 
GCACGAATTC GTGTCTGAGA AGTTGTCC 
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1 l. An isolated nucleic acid encoding a vertebrate 

2 [His 5 , Trp 7 , Tyr 8 ] - GnRH preprohormone * 

1 2. A composition according to claim 1 wherein said 

2 nucleic acid encodes a full-length [His 5 , Trp 7 , Tyr 8 ] -GnRH 

3 preprohormone . 

1 3. A composition according to claim 1 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq. ID No. 2. 

1 4. A composition according to claim 1 wherein said 

2 preprohormone is of fish origin. 

1 5, An isolated nucleic acid encoding a vertebrate 

2 [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone GAP peptide. 

1 6. A composition according to claim 5 wherein said 

2 nucleic acid encodes a full-length [His 5 , Trp 7 , Tyr 8 ] -GnRH 

3 preprohormone GAP peptide. 

1 7. A composition according to claim 6 wherein said 

2 nucleic acid encodes the GAP peptide as shown in Figure 2. 

1 8. An isolated vertebrate [His 5 , Trp 7 , Tyr 8 ] -GnRH 

2 preprohormone . 

1 9. A composition according to claim 8 wherein said 

2 preprohormone is specifically immunoreactive with antibodies 

3 raised against an immunogen consisting essentially of a 

4 polypeptide of Seq ID No. 2. 

1 10. A composition according to claim 8 wherein said 

2 preprohormone is recambinantly produced. 
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A composition according to claim 8 wherein said 
is of fish origin. 



1 12. A composition according to claim 8 wherein said 

2 preprohormone is full-length, 

1 13 . An isolated vertebrate [His 5 , Trp 7 , Tyr 8 ] -GnRH 

2 preprohormone GAP peptide. 



14. A composition according to claim 13 wherein 
said GAP peptide is competent to bind antibodies raised 
against an immunogen consisting essentially of a GAP peptide 
as shown in Figure 2. 



1 15. A composition according to claim 13 wherein 

2 said GAP peptide is recombinantly produced. 

1 16. A composition according to claim 13 wherein 

2 said GAP peptide is of fish origin. 

1 17. A composition according to claim 13 wherein 

2 said GAP peptide is full-length. 

1 18. An antibody specifically immunoreactive with a 

2 vertebrate [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone . 

1 19. A composition according to claim 18 wherein 

2 said antibody is specifically immunoreactive with a vertebrate 

3 [His 5 , Trp 7 , Tyr 8 ] -GnRH preprohormone GAP peptide. 

1 20. A nucleic acid probe capable of selectively 

2 hybridizing to a nucleic acid encoding a vertebrate GnRH 

3 preprohormone . 

1 21. The composition of claim 20 wherein said 

2 nucleic acid consists of Seq. ID No. 1. 
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1 22. The composition of claim 20 wherein said 

2 nucleic acid consists of a GAP peptide cDNA sequence as shown 

3 in Seq. ID No. 1. 

1 23. A transgenic non- human animal having increased 

2 reproductive capacity due to the expression of a DNA construct 

3 encoding a vertebrate [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone 

4 introduced into said animal or an ancestor of said animal. 

1 24. The non-human animal of claim 23 wherein said 

2 animal is a fish. 

1 25. A method of detecting a vertebrate 

2 [His 5 ,Trp 7 #Tyr' 8 ] -GnRH preprohormone in a biological sample 

3 comprising the steps of: 

4 a) contacting said biological sample with an 

5 antibody specifically immunoreactive with a vertebrate 

6 [His 5 ,Trp 7 ,Tyr 8 ] -GnRH preprohormone; 

7 b) incubating said antibody with said biological 

8 sanple to form an antibody: [His 5 , Trp 7 ,Tyr 8 ] -GnRH preprohormone 

9 complex; and 

10 c) detecting said complex. 

1 26. A method according to claim 25 wherein said 

2 biological sample is of fish origin. 

1 27. A method of detecting a nucleic acid encoding 

2 a vertebrate [His 5 ,Trp 7 , r iyr 8 ] -GnRH preprohormone in a 

3 biological sample comprising: 

4 a) contacting said biological sample with a nucleic 

5 acid probe capable of selectively hybridizing to said nucleic 

6 acid encoding a vertebrate [His 5 , Trp 7 ,Tyr 8 ] -GnRH 

7 preprohormone ; 

8 b) incubating said nucleic acid probe with the 

9 biological sample to form a hybrid of the nucleic acid probe 

10 with complementary nucleic acid sequences present in the 

11 biological sample; and 
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12 c) determining the extent of hybridization of the 

13 nucleic acid probe to the complementary nucleic acid 

14 sequences . 

1 28. A method according to claim 27 wherein said 

2 biological sample is of fish origin. 

1 29. A composition according to claim 1 wherein said 

2 preprohormone is of mammalian origin. 

1 30. A composition according to claim 1 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq. ID No. 18. 

1 31. A composition according to claim 8 wherein said 

2 preprohormone is of mammalian origin. 

1 32. A composition according to claim 8 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq. ID No. 18. 

1 33. An isolated nucleic acid encoding a vertebrate 

2 [Ser 8 ] -GnRH preprohormone. 

1 34. A composition according to claim 33 wherein 

2 said preprohormone is of fish origin. 

1 35. A composition according to claim 33 wherein 

2 said preprohormone consists of the amino acid sequence 

3 depicted in Seq. ID No. 20. 

1 36* An isolated vertebrate [Ser 8 J -GnRH 

2 preprohormone . 



1 37. A composition according to claim 36 wherein 

2 said preprohormone is of fish origin. 
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1 38. A composition according to claim 36 wherein 

2 said preprohormone consists of the amino acid sequence 

3 depicted in Seq. ID No. 20. 
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Fig. 2: 

-20 -10 +1 10 

GnRH{ 7 Trp 8 Leu): MEAGSRVIMQVLLLALWQVTLS QHWSYGWLPG GKR 

GnRH{5His 7 Trp8Tyr»: MCVSRLALLLGLLLCVGAQLSFA QHWSHGWYPG GKR 

20 30 40 50 60 

SVGEIJEATIRMMGTGGVVSLPDEANAQIQER 
EU)SFGTSEISEEIiaCEAGECSYIJlPQEESIIJlNILIJDAIARELQKRK* 
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